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PSYCHOLOGICAL CLINIC FROM PITTSBURGH 


EDITORIAL INTRODUCTION 


In applied clinical science, it is of great value to disseminate information con- 
cerning what is going on and the latest developments at important centers of practice 
and research over the country. For many years, Surgical Clinics of North America 
have served the function of describing the latest practices at important medical 
centers. It seems desirable for the profession of clinical psychology, too, to develop 
media for the exchange of information and data concerning what is going on in other 
areas of this country on some organized basis. 


To this end, your Editor early in 1958 invited clinical psychologists located in 
representative cities across the country to organize a special series of symposia to be 
known as ‘Psychological Clinics’ describing current clinical and basic research 
developments in their areas. A number of clinical psychologists responded with 
acceptable proposals for special ‘‘Clinics’’ and were designated as cooperating guest 
editors. Our guest editors were given the instructions to (a) collect a representative 
cross-section of research rather than programs in their areas, (b) encourage authors 
to write terse operational accounts of their work, and (c) to include both basic 
research reports and clinical case studies. 


Dr. Roy M. Hamlin, of the University of Pittsburgh and the Western Psy- 
chiatric Institute and Clinic at Pittsburgh, was the first to respond with a bal- 
anced symposium of significant studies. This issue contains the first ‘“Psychologi- 
cal Clinic” describing current work in the Pittsburgh area. The format of this first 
area report is frankly experimental in design. Dr. Hamlin feels that the articles 
do not attain any too well the objectives outlined by the editor. Representative 
coverage could not be achieved. Rather than emphasize research from strictly clinical 
settings, the selection aimed at diversity of approaches. The American Institute of 
Research, the Veterans Administration, a family center, a clinical laboratory, a 
hospital, a psychosomatic clinic, a speech department, an animal laboratory, and a 
selection program are represented. But Pittsburgh studies of brain implantation, of 
sensory deprivation, of adrenalin and emotion, of learning in defectives, of childhood 
schizophrenia, of aging, and many others are not included. 


The order of the articles progresses from more clinical or case studies, to cor- 
relational and technique development studies, to laboratory research. Two of the 
case studies lay the groundwork for more controlled research approaches. And the 
final animal study prepares the way for research with human subjects, and perhaps 
for clinical applications. The cycle is complete. 


It is hoped that this initial experiment will pave the way for a long series of 
“Clinics” from all the important centers of the world. Clinical psychologists who 
feel they are in a position to sponsor future ‘‘Clinics’’ from their areas are invited to 
submit proposals to the Editor. The only restrictions as to content or format are 
that the articles should report significant new research developments and clinical 
practices rather than simply being reports of programs in progress of purely local 


interest. ‘ 
Roy M. Hamurn, Guest Editor 
Freperick C. THorn#, Editor 





INTERVIEW AND TEST DATA IN A DIAGNOSTIC PROBLEM: 
A CASE REPORT 


MARCIA SWARTZ AND EVA D. FERGUSON 
Western Psychiatric Institute and Clinic, University of Pittsburgh 


PURPOSE 

This paper reports a diagnostic problem the solution of which was facilitated 
by psychological evaluation. The subject, a 57-year-old white woman, divorced and 
a retired school teacher, presented a long history of illness attended by many exam- 
inations for organic disease, state hospitalization, psychotherapy and electroshock. 
This history of chronicity plus the physician’s initial impression of the patient sup- 
ported the assumption that her difficulties were largely “functional”. The data 
acquired during the course of treatment prompted reassessment of the case. Psycho- 
logical tests were requested and, combined with the data of the interviews, led to the 
identification of an endocrine factor of major significance. The purpose of this paper 
is to elucidate the diagnostic difficulties, and to describe the specific test data upon 
which diagnostic conclusion was based. 

A brief clinical history will accompany the discussion of the diagnostic problem. 
A dynamic formulation of this material will not be presented, but rather the data of 
the case itself, so that the clinical situation and the accompanying difficulties are 
clear. 


METHOD 


The weekly psychotherapeutic interviews totaled 30 hours. At the beginning of 
treatment, the attending physician performed a routine physical examination. Upon 
referral from the therapist, the psychologist administered the following tests: Early 


Recollections, Human Figure Drawings, WAIS, Rorschach, and Wechsler Memory 
Scales, Form I. 


THERAPY INTERVIEWS 

The patient came to our clinic in search of a cure for an illness characterized by 
a multitude of vague symptoms which had endured for eight years despite extensive 
medical treatment. She discussed in monotonous detail, tightness in her chest, loss of 
balance, loss of hearing, stiffness in all body joints, vague shooting pains traversing 
her entire body, transitory swelling in fingers and legs, and fatigue. Disturbed by her 
failures of concentration and memory, she withdrew her interest from the life around 
her. 

Although the patient had been plagued by various aches most of her adult life, 
the above symptoms had not appeared until a few weeks after an hysterectomy in 
1950. Since that time, she had undergone many examinations, tests such as cardio- 
grams and measurements of basal metabolism, and many treatments including 
estrogen, electroshock, and a few months of psychoanalysis. Improvement would 
occur but be temporary, and this fluctuation in symptomatology could not be re- 
lated to incidents of special significance in the patient’s life. The patient, unable to 
continue her work, resigned her teaching position and voluntarily entered a State 
Hospital where she stayed for one year. While there, she made one ineffectual suicide 
attempt. After discharge, she again made the rounds of medical offices and finally 
appeared in our clinic. 

Physical examination revealed only hypertension, the blood pressure being 
170/90, a condition diagnosed eighteen years previously. Early in treatment, the 
therapist was tempted to label this drab, clinging, depressed woman a hypochondriac 
who had stubbornly resisted competent medical assistance. Observation of the 
patient did not allow of such dismissal. The patient seemed intelligent and well- 
oriented but, on the other hand, exhibited many bizarre mannerisms such as fondling 
her body as she complained and pulling at her chest as she breathed. She dressed 
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carelessly; she expressed affect at times inappropriately; and she produced super- 
ficially logical but fanciful descriptions of body physiology to explain the symptoms 
which were suggestive of many disease entities but typical of none. In brief, con- 
sideration of a diagnosis of psychosis was reasonable. As therapy progressed, the 
patient began to dress more carefully and to express her feelings more appropriately 
but the therapist held fast to these earlier suspicions. 

Prior to 1950 the patient had been a compulsive, conscientious and energetic 
school teacher who supported herself while securing an education. This independ- 
ence was highly valued for as a child she had been severely restricted by a chronically 
ailing, yet domineering mother whom she silently resented. In her early years, she 
was rebelliously tomboyish and outgoing; but with the onset of puberty, she began 
to withdraw from others and had few friends. Despite her great effort, her school 
grades were only fair. 

At 21, she married a man who disliked work and responsibility but whom the 
patient schooled and partially supported. A year following this marriage, the 
patient’s father died of pernicious anemia. During his illness, the patient contrib- 
uted to the support of the family and continued to be their financial mainstay until 
her illness. Her own marriage was beset with financial difficulties; sexual adjustment 
was poor for the entire eleven years of the marriage, and three pregnancies were 
terminated by the patient. The husband demanded a divorce, but the patient, al- 
though cognizant of his greedy dependency and of his extramarital affairs, refused. 
However, an out-of-state divorce was granted him. Ten years passed before the 
patient felt she had recovered from the depression following their separation. The 
patient’s mother died in 1944 in a state mental institution of confused senility but 
before her death had symptoms which the patient describes as identical to those 
she now presents. 

As the patient ventilated her shame over her enforced dependency and hostility 
toward her family, she appeared less depressed, dressed more appropriately, and 
spoke and moved with more energy. Suddenly - for no discernible reason - she again 
became depressed with the symptoms increasing in intensity. Psychological tests 
were requested because of (a) the initial psychotic appearance of patient, (b) the past 
history of resistance to various treatment, (c) the ambiguity and bizarre nature of the 
complaints, and (d) the erratic fluctuations of the symptoms before and during the 
therapy. The therapist was faced with the question of whether some organic dys- 
functioning was responsible for this increased impairment or whether the ego was 
failing in its function and the patient turning to psychosis. The problem was not 
diminished by the realization that both factors could be operative. The results of 
the psychological examination follow. 


PsYCHOLOGICAL EXAMINATION 

When first seen, the patient gave an impression of eccentricity which diminished 
on further contact. Unkempt attire, scratching, automatic complaints of weariness, 
diverse coquettish arm and body movements, thick speech, and a general air of dis- 
organization and limited capacities contributed to the initial picture. Lip movements 
seemed heavy and lacking in dexterity, and she moved her mouth in persistent, man- 
neristic patterns. In recounting Early Recollections, she could not adapt to the ex- 
aminer’s pace, compulsively relating events in a rapid, unmodifiable way. During 
br Figure Drawings, she prattled like a young child commenting on what she was 

oing. 

As the tests proceeded, however, she became increasingly relaxed, appropriately 
cooperative, friendly, and even out-going, though maintaining an easy professional 
air. Moreover, she arrived for a second testing session attractively and neatly 
dressed, and generally showed an acceptable degree of poise and efficiency. 

The most conspicuous overall feature of her test performance reflected the same 
disparity as the behavior just described. Essentially well-integrated and intelligent 
in her general test performance, she nevertheless showed frequent, sometimes ex- 
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treme, lapses in functional level. The most characteristic and striking discrepancies 
consisted of variation, from a high degree of creativity or conceptualization to a level 
of marked literalness. For instance, many Rorschach responses showed superior 
organization and form level, originality, and at times, capacity to integrate shading 
or color with form and movement. In contrast, she gave many perseverative and 
stimulus-bound responses, the most frequent being indiscriminate ‘“‘maps’’ to dark- 
light shaded areas, and ‘‘claws’’ to tiny, hooked details. Occasionally, her usual 
ability to integrate stimuli seemed to desert her, so that she would account for one 
tiny detail, then another, and only later integrate these into an adequate response. 
This fumbling procedure contrasted sharply with an easy ability to integrate 
other, often complex and original, responses. 

The same pattern, of stimulus-bound in contrast to conceptually alert responses, 
appeared throughout other tests. Her slow, uncertain, and circumlocutious perform- 
ance on Picture Arrangement gave a weighted score of 7 in contrast to her highest 
weighted score of 17 on Comprehension and to the total WAS 1Q of 119. Her learn- 
ing and recent memory on the Wechsler Memory Scale highlighted the unevenness of 
her performance by showing much more confusion than would be expected from the 
general level of functioning found on the various tests. 

The striking disparity so far described constituted the starting point for the 
remainder of the psychological analysis. Extreme disparity could suggest psychosis, 
severe neurosis, brain-damage, or other factors. The examiner utilized the following 
hypotheses: (a) if the primary etiology was a functional one, severe disturbances in 
feeling, in thought and in general personality integration should be reflected in the 
test data; (b) if brain damage determined the degree and pervasiveness of the un- 
even functioning, the damage should be sufficient to produce some of the more com- 
mon signs. The test data supported neither of these possibilities. 

The Rorschach alone, with the productive and generally competent handling of 
72 responses, makes the probability of psychosis or severe neurosis unlikely. With a 
wealth of verbalization increasing the likelikood that any psychotic tendency would 
be revealed, she gave no bizarre or really inappropriate responses. Neurotic indica- 
tions of depression, anxiety, cautious checks on affect, and the like do appear, but 
essentially come within the range for normal subjects, certainly not suggesting 
severe neurotic incapacitation. Figure Drawings, WAIS performance, and Early 
Recollections support the same logic. The confusion and disparities which occur have 
a sporadic character, and do not suggest personal distortion so much as uncertainty 
about stimulus structure and adequacy of response. Nor does the uncertainty carry 
with it the burdensome tension of pervasive neurosis, 

Signs of brain damage do not stand up convincingly on inspection. The poor 
memory on the Wechsler Memory Scale paralleled her complaints of impaired con- 
centration and memory, and a lowering of performance on speed tests provided one 
possible sign of brain damage. But she displayed sustained effort, alert integration 
and recall on other tasks, such as Rorschach and Early Recollections. The general 
richness of her overall performance and the high conceptual level she was able to 
maintain on such tasks as Block Designs and Similarities did not support the usual 
picture of brain damage. Thus, the variability of intellectual functioning that did 
appear suggested the possibility of some kind of organic (neurological and /or physiol- 
ogical) dysfunction, but other aspects of her test performance discounted the likeli- 
hood of brain damage. 

A final consideration, influenced chiefly by the patient’s Early Recollections, led 
the psychologist to reject the probability that the patient’s symptoms were on a pri- 
marily functional basis; the presence of some kind of organic dysfunction, of an un- 
specified nature, was consequently taken as the most tenable diagnosis. The patient 
gave 12 Early Recollections, all essentially revealing the same pattern: a basically 
assertive, energetic, fairly comfortable woman, who likes to explore life and live it 
fully, albeit more according to her own whimsical and independent spirit than ac- 
cording to careful concern about rules and propriety. For example, she recalled hav- 
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ing wanted to go to school before the age of five and being allowed to go when she 
turned five. Her mother asked: ‘‘“Now what will you tell the teacher?”’ ‘‘I’ll tell her 
I’m five, but I want to come to school.’”’ She went by herself to school, and the 
teacher told her she wasn’t old enough, but could stay till noon. She accepted the 
concession of staying till noon, with no recollection of resentment: “‘I had just found 
out for sure that they wouldn’t take me.” 

The twelve memories together gave a picture of a woman who views life as 
interesting and challenging, who tries things and wants to “find out’’, who may skim 
over hostile feelings in herself and in others, but in a rather serene way. The clinical 
picture and initial behavior during psychological examination seemed completely out 
of keeping with the Life Style this woman revealed in her Larly Recollections. The 
general solidity of the Rorschach confirmed a strength of personality which could not 
logically lead to the drab, clinging, disorganized symptom picture presented, on a 
functional basis, i.e., on a basis of primarily satisfying personal needs. Since brain 
damage of a more devastating nature does not find convincing support, the most 
plausible explanation might come from consideration of some other organic dysfunc- 
tion, accounting for unevenness in perceptual and intellectual] functioning as well as a 
variety of other effects. 

The psychological report therefore minimized the probability of psychosis, or 
of any primarily functional disturbance; and favored rather a loss of mental effi- 
ciency of unspecified, probably-non-brain damage, origin, with secondary functional 
manifestations. Subsequent discussion with the psychiatrist brought out that some 
of the performance losses and patterning could suggest endocrine dysfunction. 


DIscUSSION 

Armed with the knowledge provided by the psychologist, the therapist was able 
to take appropriate steps for differential diagnosis. Hypertension had been present 
for many years but in the absence of neurological changes, brain damage due to a 
series of ‘“‘small strokes’? was discounted. A more encompassing process, perhaps 
endocrine in nature, seemed in evidence. Despite the fact that the patient’s basal 
metabolism measurements had been within normal limits previously and despite the 
fact the lethargy and depressed tendon reflexes characteristic of hypothyroidism 
were not present, thyroid function was again investigated. At this time, the blood 
chemistries* were diagnostic of hypothyroidism and the cardiogram compatible 
with this diagnosis. The patient was placed on a grain of thyroid extract daily with 
marked improvement. A further evaluation of the patient’s psychological function- 
ing must, of necessity, be postponed until homeostasis is once more established. Such 
an evaluation does not lie within the scope of this paper, but rather, this material 
illustrates the importance of the psychological examination in the differential 
diagnosis of this case. 


SUMMARY 

This case history of a 57-year-old white, divorced school teacher has been pre- 
sented as a problem in diagnosis of a chronic illness resistant to extensive treatment 
and labeled as functional. Considerations in the course of treatment led to a re- 
assessment of the patient with psychological tests being utilized as an aid in this 
endeavor. The psychological examination, complementing the data derived from the 
interviews, facilitated the solution of the diagnostic problem with the identification 
of a major organic factor, hypothyroidism. 


*1. Protein Bound Iodine 1.6 gamma % (Normal range 4 - 8 gamma %) 
2. Total Blood Cholesterol 520 mgm % (Normal range 150 - 250 mgm %) 





EFFECT OF TRADITION ON HUSBAND CHOICE 
AND ROLE AS WIFE: A CASE REPORT 


HELENE BORKE AND MARGARET B. MC FARLAND 
Arsenal Family and Children’s Center, University of Pittsburgh 


The present report deals with one facet of a more extensive three generation 
case analysis of the Maslowskis, a Polish family. Selected material will limit con- 
sideration to the effect of tradition on the three Maslowski sisters in their choice of 
husbands and in their perception of themselves as wives and mothers. The concept of 
tradition includes national tradition (in this instance, Polish), local tradition (resi- 
dence in a Polish American community), and family tradition (patterns of values, 
goals and ways of relating characteristic of this particular family). 


SETTING 


The data were collected at the Arsenal Family and Children’s Center, a division 
of the Arsenal Public Health Center, which has three major approaches as a family 
service: a nursery school for preschool children, group work with school age children 
of the same families, and both individual and group work for the parents. The 
particular direction taken by these activities has been influenced to a considerable 
extent by the needs of the population in the Arsenal district. The Center is located 
in an highly autonomous neighborhood made up of Polish, German, Slavic and 
Italian working class families. The Maslowski’s are illustrative rather than unique 
in their cohesiveness. It is not at all atypical to find three and sometimes four genera- 
tions of a family all living within a few blocks of each other. Because of this co- 
hesiveness, the acculturation process has been unusually gradual, and the community 
provides an excellent opportunity to study the integration of the old traditions with 
the new as it develops from one generation to the next. 


MeEtTHOD 


From its inception five years ago, the Arsenal Family and Children’s Center has 
emphasized research? as well as service. Over this period, a team of workers has gath- 
ered and recorded data on many families participating in the Center’s program. 
Psychologists, sociologists, psychiatrists, nursery school teachers, and group workers 
have added bit by bit to the accumulated data. Formal home visits, shared com- 
munity activities, marriage and family counseling, school contacts, structured tests 
and systematic observations are included in the records. Key members of the 
Center have gradually been accepted as participants in the lives of these families. 

For the Maslowski family there are interviews with the three sister-mothers 
over a five year period and play observations, developmental histories and psycho- 
logical reports on all of the seven cousins. The present report draws chiefly on 30 
interviews with the sister-mothers. Since the philosophy of the Center’s staff has 
always stressed the total family, all three mothers were encouraged from the be- 
ginning to discuss anything they wished. Sometimes they concentrated on the child 
attending the center but just as often they talked about significant events in their 
own lives. To organize this considerable mass of data, each of the interviews was 
divided into separate units which met the criterion of being essentially one thought - 
or idea. These units were then categorized. Some of the categories were the obvious 
ones of children, self, husband, parents, and siblings. Others such as “‘reciprocalness”’ 
were suggested by the data. In a few instances a category which emerged as signi- 
ficant for only one of the sisters provided important additional insights into the 
records of the other two. 


1A monograph on the Maslowski family is scheduled for publication in the spring of 1959. 
*Erik Erikson, as chief consultant for the Arsenal Family and Childrens Center has greatly in- 
fluenced the philosophy and focus of the Center’s research. 





EFFECT OF TRADITION ON HUSBAND CHOICE AND ROLE AS WIFE: A CASE REPORT 129 


RESULTS 


Similarities between the sisters. There are at least six ways in which the Maslowski 
sisters resemble each other: 


1. Physically, they are large women with broad faces and thick, coarse, light brown hair. Their 
body structure, facial characteristics, and hair texture, all identify them as coming from 
similar stock. 

All three sisters are described as active, energetic and productive. One might best visualize 
them as hard working farm women. There are no shrinking violets or swooning ladies in the 
group. There are also no glamour girls. In the records the housewifely qualities of these 
women come out primarily in comments about food and cleanliness. The importance of food 
and feeding is indicated by such observations as “the kitchen is somehow the light of the 
house’’ when describing the older sister’s home and “‘the kitchen seems to be the center of the 
house”’ when speaking of the second sister’s home. As far as cleanliness is concerned, all three 
sisters’ homes are described as “‘immaculate’’ and “shining with cleanliness’. 

The three sisters express similar general attitudes about the roles of men and women in their 
original family. All three regard each other as sources of help and the theme of reciprocalness 
appears over and over again in the records. Underlying this relationship of mutual support 
is evidence of strong competitive feelings between the sisters. The close juxtaposition of these 
—- and competitive feelings is suggested by an interviewer’s observation that Mary 

rubar, the oldest sister, spoke of her sister’s children with pride and affection but at the 
same time compared them as though they were siblings. Although these competitive feelings 
appear to be very strong at times, they are apparently never permitted to endanger the posi- 
tive reciprocal relationships between the sisters. Within the family they are expressed covert- 
ly and only appear in a more blatant form in external relationships. 

The father and brother ave perceived by all the sisters as receiving help rather than 
giving it. They do not participate in the reciprocally helpful interaction shared by the women. 
The male role of the elder Maslowski was characterized by hard labor outside the family and 
an expectation of being waited on and served when at hore. According to Mary Trubar when 
her father came home, if his towel was not where it should be, he would begin to shout and the 
whole family would run around hunting it. Mrs. Maslowski was given the money each pay- 
day and was expected to assume complete responsibility for the home and children. 


Socially, the focus for each of these women is the family. It is primarily within the family 


that they = ge to have their significant relationships. Sophie Dulemba, the youngest sister, 


together with her husband and two children, Michael, age six, and Joanne, age three, con- 
tinues to live in the home of the older Maslowski’s. Helen Muller, the middle sister, and her 
family which consists of her husband and three children, Christine, age nine, Carol, age five, 
and Johnny, age three, live in their own house only a few blocks away. The oldest sister, Mary 
Trubar, who’s husband deserted when she was pregnant with her youngest child, rents an 
apartment nearby where she lives with her two sons, Gary, age nine and Jimmy, age seven. 
Only the younger brother, Paul, has moved to the suburbs. The importance to these sisters 
of perpetuating the cohesiveness of the larger family is perhaps expressed most dramatically 
in the timing of their pregnancies. Every child with the exception of the sister-in-law’s most 
recent baby has a cousin of approximately the same age. The oldest, or nine-year-old pair, 
consist of Christine Muller and Gary Trubar. Michael Dulemba and Jimmy Trubar form the 
six-year-old pair. Five-year-old Carol Muller is paired with five-year-old Linda Maslowski, 
Paul’s oldest child. The youngest pair consists of the two three-year-olds, Johnny Muller and 
Joanne Dulemba. That factors other than pure chance operated in the timing of these preg- 
nancies tends to be confirmed by Mary Trubar’s statement, “In our family we have our babies 
a ye None of us were ever pregnant alone except my brother’s wife when she had her 
it baby.” 

The cohesiveness which characterized these sisters also includes their husbands’ families. 
There are repeated references in Mary Trubar’s record to her in-laws and it is apparent that 
her relationship with her husband’s people persisted long after his desertion. The following 
description of Helen Muller’s home occurs in one of the records: ‘“‘The Muller’s front yard is a 
continuation of the senior Muller’s back yard. Mr. Muller’s brother, wife and nephew live 
next door.’’ The interviewer went on to remark “relatives almost surround them.” 

Although two of the Maslowski sisters married men of different ethnic and religious back- 
grounds (Mr. Trubar is Yugoslavian and belongs to the Serbian Orthodox church and Mr. 
Muller is a German who was brought up Protestant but who was converted to Catholicism 
when he married), their husbands are similar in several important respects. Each of the men 
is a first generation American at about the same socio-economic level as the Maslowskis, 
(Mr. Muller’s father owned a tavern and Mr. Dulemba’s father was a tailor). Also all three 
husbands came from large families in which the relationship between the parents and atti- 
tudes concerning male and female roles were similar to those described above for the Mas- 
lowski family. 

In their roles as mothers, the Maslowski sisters stress education and appear to encourage 
their children to learn new ways and to surpass their parents in their socio-economic goals. 
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Whenever any of the children achieve or excel, they are reacted to with pride by the entire fam- 
ily. Disciplining typically consists of coaxing, bribing and firmness before a final frustrated re- 
sort to physical punishment. All three sisters use food together with the promise of new exper- 
iences as motivating devices. They encourage initiative and independence particularly in the 
girls as well as close family ties between the cousins. All of these mothers seek advice about 
child rearing from close female relatives rather than relying on books or experts. 


The sisters as individuals. From an early age, Mary Trubar, oldest of the sisters ap- 
pears to have adopted the role of the mother in her relationship with her sisters and 
younger brother. It is not surprising, therefore, that in her description of her hus- 
band, it requires little imagination to see the acting out of a mother-son relationship. 
According to Mary, her former husband was an only son who had been indulged and 
waited on by his mother. She is quoted as saying that if she were ever to remarry, 
she would chose a man who was “‘independent’’ and not ‘‘dependent on her like an- 
other child”. Of considerable interest along these lines is the timing of Mr. Trubar’s 
desertion. It occurred when Mary was pregnant with her second child. We can 
speculate that Mr. Trubar wanted a mother as much as Mary wanted a son and two 
siblings may have been more competition than he could tolerate. 

Mary’s conflict between her role as wife and her role as mother is suggested by 
the events leading up to her first pregnancy. In an interview with a member of the 
nursery schoo] staff Mary confessed that she went home and cried after her sister 
Helen confided to her that she was pregnant. Although Mary had been sterile for 
the preceding seven years of her marriage, she conceived only five months after her 
sister made her announcement. Mr. Trubar’s subsequent desertion suggests that 
something unconscious in Mary’s relationship with her sisters, possibly her need to 
retain the role of the maternal authority which would not have been possible without 
children of her own, apparently outweighed her need to keep her marriage intact. 
In keeping with this it is interesting that Mary might best be described as the mother 
who “knows’’. She is the authority on child care in the family. The records contain 
many references to the advice she has given both Helen and Sophie on how to bring 
up their children. 

After her husband left, Mary tried valiantly to continue in her traditional role 
as housewife and mother. For five and a half years while the children were still 
young she managed on Public Assistance although she admitted hating the idea of 
having to “accept charity’. During this period she took great pride in her home and 
was described as a magnificent cook by everyone who was fortunate enough to eat 
at her table. When it finally became necessary for her to work Mary appears to have 
tried experimenting with the culturally traditional male role that she was familiar 
with. She handed her two boys over to her mother and her pride in housekeeping 
and cooking virtually disappeared. Perhaps most interesting was her withdrawal 
from the close reciprocally helpful relationships with her sisters and her expectation 
= being ‘‘waited on’’ by them which in this family is traditionally the prerogative of 
the male. 

In a recent interview Mary confided that she would very much like to remarry. 
It seems that this woman who might be described as masculine in the way she per- 
forms her feminine role, is not too happy when she is forced to assume the masculine 
role as defined by her cultural background. Unfortunately, the man she is interested 
in has shown signs of not wanting to tie the nuptial knot. He gives as his reason that 
his mother needs him and that Gary, Mary’s oldest boy, is jealous of him. Mary ap- 
pears to be in bitter conflict as once again her children threaten to stand in the way 
of her establishing a maternal relationship with a man. 

The relationship between Helen Muller and her husband most closely resembles 
the husband-wife relationship of the older Maslowskis. Like grandfather Mas- 
lowski, Mr. Muller is a hard worker and provides adequately for his family with his 
skilled job in a steel mill. He is described by his wife as “expecting things done for 
him such as having his clothes laid out and his meals served’. Mrs. Muller also said 
that her husband was “given to quick rages’’ when they were first married but that 
he has mellowed over the years. A quick temper is also characteristic of the elder 
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Maslowski who, however, has not changed over the years. There is an even more 
significant respect in which Mr. Muller differs from his father-in-law. This is in his 
willingness to share with his wife some of the housekeeping and child rearing tasks 
which, traditionally, in both his wife’s family and his own, were handled exclusively 
by the mother. 

Helen Muller, in keeping with her personality, is the mother who seems the 
least sure of herself although again it is almost a contradiction in terms to describe 
any of these mothers as unsure or insecure. When Helen’s second daughter Carol 
was tiny she drank so much milk that Helen had to store the child’s formula in a 
half gallon wine jug. Sister Mary was consulted and immediately recommended 
cereal. When Helen hesitated because the baby was so young Mary told her to hand 
Carol over with the comment, ‘‘I’m not afraid’’. 

Sophie, the youngest of the sisters, appears to have been the most active and 
demanding of the three. The direct manner in which she still attempts to satisfy 
her dependency needs is suggested by her continuing to live with her parents even 
though she is married and has children of her own. The relationship between Sophie 
and her husband has some elements in common with the marital relationships of 
each of her older sisters. Mr. Dulemba, like Mr. Trubar, has apparently been un- 
willing to accept the full responsibility of providing for his family. Not only does he 
continue to live with his in-laws but when his skilled job in a steel mill was reduced 
to three days a week he balked at finding supplementary work. As a result, Sophie 
has been forced to assume some of the financial responsibility for the family. When 
his wife works, Mr. Dulemba takes care of the children. It is in this role that he 
bears some resemblance to Mr. Muller. However, unlike Mr. Muller who apparently 
enjoys his active participation within the home, Mr. Dulemba is quite open in ex- 
pressing his resentment over having so much household responsibility. 

In contrast with beth her sisters Sophie seems to be the least anxious mother. 


Whereas Mary appears to cover up her anxiety by ‘“‘knowing’’ and Helen worries 
openly, Sophie states quite matter-of-factly that she simply put her oldest child, 
Michael, in the garden each day and “forgot about him’’. Since Michael stands out 
in rather sharp contrast to the other cousins, this difference in approach to mothering 
between Sophie and her sisters may possibly be significant. 


DISCUSSION 

The cohesiveness of this Polish family permits us to observe, almost in slow 
motion, the changes that occur in male and female roles as the old cultural patterns 
and traditions come in contact with the new. Because of their Polish farm back- 
ground all three of the Maslowski sisters perceive the female role as an active and 
productive one. Like the generations of farm women who preceded them, they con- 
sider it only natural that they should assume the primary initiative in the manage- 
ment of their homes. They relate to each other in an highly reciprocal manner which 
provides them with additional sources of help and support in carrying out their 
many tasks as wives and mothers. Traditionally, the men in the family are respons- 
ible for its financial support. They are not expected to participate in the reciprocal 
relationships shared by the women but instead are waited on and served by them. 
This, in brief, describes the pattern of the relationship between the older Maslowskis. 
It is the pattern of male-female roles that the three Maslowski sisters were exposed 
to as children. In a Polish peasant farm community, it was probably the only pattern 
possible if the family was to survive. 

Mary, the oldest sister, appears to have incorporated the traditional patterns 
for male-female relationships to a greater extent than either of her sisters. Since her 
marriage ended in divorce we can speculate that these patterns did not prove particu- 
larly adaptive in the new cultural setting. However, when Mary was forced by cir- 
cumstances to go out and support her family, she apparently derived considerable 
ego support from her knowledge of the traditional male role. The literal way in 
which she proceeded to act out this role is extremely interesting and suggests that 
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her experimentation with the male perogative of working outside of the family 
aroused considerable anxiety. 

The relationship between Sophie, the youngest sister, and her husband suggests 
some experimentation with new ways of interpreting male-female roles. Sophie, 
unlike her sister Mary, appears to enjoy working outside of home, and Mr. Dulemba, 
albeit reluctantly, does help his wife out with some of her household and child rearing 
responsibilities. 

Helen Muller’s relationship with her husband probably represents the best 
compromise between the old patterns and the new. Of all the husbands, Mr. Muller 
most closely resembles grandfather Maslowski. Although he is similar to the elder 
Maslowski in his basic ideas about a husband’s duties and prerogatives, he differs from 
his father-in-law in his willingness to help his wife with household tasks and taking 
care of the children, something the elder Maslowski would never have considered. 
A combination of at least three factors appear to have contributed to this new inter- 
pretation of the husband’s role in the Muller family. The first stems from Mr. 
Muller’s background. Mr. Muller was one of five boys. He and his brothers, in the 
absence of a sister, were forced into the role of helpers by their overworked mother. 
Consequently, this was a role Mr. Muller was already familiar with from his own 
childhood. 

The second stems from Helen Muller’s position in her family. As the second 
girl, Helen for all practical purposes had two mothers. Although she participated in 
the cult of helpfulness with her mother and sisters, she emerges as the daughter who 
was given to and served almost as much as her younger brother, Paul. It is of con- 
siderable interest therefore that Helen married a man who was willing to perpetuate 
the role of the sister-mother toward her in its masculine equivalent of the husband- 

artner. 
5 The third factor can be found in the traditions of the American culture where 
the socially approved husband role is that of the partner. Whereas the patriarchal 
traditions of a Polish or German culture would probably have operated to reinforce 
the characteristics Mr. Muller has in common with his father-in-law and de-empha- 
size his earlier indoctrination as a helper, the forces in the American cultural tradi- 
tion have tended to operate in the opposite direction. 

In their roles as mothers all three sisters appear to be experiencing strong con- 
flicts over relinquishing the old patterns for the new. Although consciously they 
encourage their children to become educated in the new ways, the fact that they have 
worked so hard to maintain the family as a cohesive unit through the ingenious de- 
vice of the cousin pairs, suggests that there is a deep unconscious need to perpetuate 
the traditional values and customs. The evidence suggests that these three sister- 
mothers have already communicated to their children much of what they themselves 
were taught about male and female roles by their Polish farm parents and that their 
children will again be taced with the task of reinterpreting the old patterns in the 
light of the new. 


SUMMARY 

1. The records of 3 Polish sisters and their seven children were collected over a 
five year period at the Arsenal Family and Children’s Center. 

2. In the present report selective material is presented on how Polish agrarian 
tradition and changes in traditional values from contact with the American culture 
influenced these three sisters’ husband choice and interpretation of their roles as 
wives and mothers. 

3. The analytically descriptive case report approach presented in this paper 
represents the first step in testing hypotheses on how tradition influences individual 
behavior and development. 
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INTRODUCTION 


The elusive concept that psychosomatic disorders occur in individuals with 
specific psychological or physiological characteristics has proven difficult to demon- 
strate. More promising is the multi-variable hypothesis that certain psychosomatic 
disorders require (a) a physiological predisposition, (b) one of several related pat- 
terns of personality, and (c) a degree of similarity of stressful life situations. To pro- 
vide an adequate test of this complex and variably manifested prediction forces the 
investigator to an intensive, preliminary and descriptive analysis of clinical material. 
This paper presents a single case, selected from 35 cases of chronic urticaria (chronic 
hives) which have been studied intensively by a psychosomatic research team. The 
clinical study of these cases was the first step in an attempt to construct a pattern 
of causation which could then be submitted to experimental tests. The purpose of 
this paper is to demonstrate the difficulties of such an approach — the extraction of 
important variables, their translation to objective measures, and the movement 
to theory-testing experimentation. 

The personality correlates of chronic urticaria have been suggested by several 
previous studies“: *: §. 4. 5. 8), Prominent in the discussions of the associated person- 
ality characteristics were (a) marked emotional responsiveness, (b) skin erotism, (c) 
body preoccupation, and (d) exhibitionism. While the patients were overtly sub- 
missive and pleading or very aggressive, they all seemed to have strong, underlying 
hostility partially expressed in masochistic trends. The presumably important pre- 
cipitating psychological factors differ widely between studies. While most authors 
mention a generally traumatic life situation, one claims the important factor is a 
feeling of resentment over unjust treatment about which the patient feels he can do 
nothing”; others stress a feeling of unconscious anger“, sadness“), or the frustra- 
tion of intense longings for love. 


Tue CLINICAL StTupy 


The 35 chronic urticaria cases were referred because emotional problems were 
evident, no physical causation could be found, or because of our group’s known 
interest in the problem. The subjects approximate the general population, however, 
in age, education, and socio-economic level; and the approximate sex ratio of the 
disease (two women to one man). Almost every patient was studied intensively by 
each of the authors (clinical psychologist, psychiatrist, allergist, and psychiatric 
social worker). The findings from the two year, intensive, clinical study of these 
cases will be pubtished elsewhere“. The main points, highly abbreviated, are pre- 
sented below. 


Physiological predisposition. No specific evidence for an underlying allergic 
mechanism was found. Other significant medical diseases were rare in these patients 
except for obesity and psychogenic gastrointestinal symptoms. 


Personality. These patients did not present a single picture, but rather fell into 
three groups; passive-dependent, acting out, or relatively normal. Childhood exper- 
iences showed more homogeneity, but chiefly in the general, emotional tone rather 
than in actuarial specifics. Parental violence and conflict characterized the early 
home life, with shifting parental figures and rejection of the child. The child (fre- 
quently an only child, or the oldest or youngest) felt this rejection and grew up with 


1M. I. Levine, M.D., is Instructor in Medicine, Section on Clinical Immunology, Department of 
Medicine. The remaining authors are on the staff of the Staunton Clinic, Department of Psychiatry. 
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chronically frustrated dependency needs associated with resentment and depression. 
An overly controlling mother required adherence to an extremely moralistic and 
restrictive ideal role. This generated a marked tendency to chronic, inhibited but 
not repressed, hostility and masochism. Genitally, sexual impulses appeared often as 
exhibitionism, but were otherwise avoided because of the violence felt to be asso- 
ciated with sexuality. An activity defense against these various disturbing feelings 
was common. 


Precipitating situation. Hives appeared usually as the patients shifted between 
activity and passivity in a conflictful, dependency situation. This usually involved 
resentful withdrawal from key social relations with a sense of entrapment. 


Tue CasE 


The following case illustrates the difficulties encountered in specifying measur- 
able variables which adequately parallel the composite, clinical description offered. 
At the same time, the descriptive similarity of the single case to the composite pic- 
ture is apparently striking. 


Mrs. T., chosen because of the ready availability of data rather than representativeness, is 
one of the patients in the Passive-Dependent group. She is a forty year old, married woman of 
Italian extraction with three children, 17, 15, and 10. She is short, dark, plump, and congenitall 
blind in the right eye. For the past three years she has had chronic urticaria which was comers d 
ized in distribution and very annoying. There was no past family history of allergy, and medical 
examination, including routine laboratory studies, was negative. Allergy testing failed to reveal 
any specific allergic factor related to the hives. ‘‘Allergy shots’’, diet therapy, and various medica- 
tions were tried over two years with no improvement. As with most of the 35 cases, the clinical 
material throws little light on the physiological predisposition “. 

Her social history shows the early parental shifting, neglect, and the overly demanding 
mother figure. The patient’s mother died when the patient was two years old. The stepmother, 
frequently sick, was an obsessively neat housekeeper. She apparently treated the patient as a 
drudge until she died when the patient was age sixteen. The patient then kept house and was 
mother for the three half brothers. ‘‘Father didn’t confide in us’”’, she said. He favored the bo 
and yelled at her except when he wanted something. She apparently married to get out of the 
house. This material suggests the early basis for a longing for acceptance. 

The resentment and depression over frustrated dependency needs and the paaive-seupemive 
reaction to authority figures (seen as overly demanding) were shown Gury rome taking the 

ychological tests. She arrived complaining of tiredness, a headache, and of being forced by her 
Steel to come to the clinic early. After being surprised about having to do mental tasks she 
defined the situation as the examiner’s forcing her to do unpleasant things. She did the tasks only 
after protesting her inabilities then asking for much clarification, help and reassurance. Her 
posture deteriorated to the point where, thumb in mouth, she was resting her head and shoulders 
on the table while telling the TAT stories. Later, during psychotherapy, she often complained of 
the great amount of work she had to do at home, that she usually feels tired, has occasional epi- 
sodes of feeling very “‘blue”’, and that she tries to fight them by keeping active. Here the activity 
defense to depression is shown. 

About authority figures, she complained of being pestered, criticized, and of receiving no 
help. She would admit with a smile, though, that under a regime of strict control from a man who 
did some giving to her, she does what she should and looks and feels her best. In this setting she 
is also very exhibitionistic. She arrived for her psychotherapy sessions in low-necked, flowered 
dresses with white gloves and a broad brimmed white hat. ough heterosexual relations are 
accepted, they are not emotionally gratifying to her. She sees herself then as little more than 
food which is being enjoyed. The low estimate of herself is strikingly revealed in the TAT by 
references to people like herself as ‘‘it’’. 

With regard to present stresses and the onset of symptoms, the patient lives with her family 
in the home of her husband’s parents. The patient is irked by the interfering mother-in-law but 
cannot get her husband to move elsewhere. “I just have to take it’’, she says. “They are con- 
trolling everything!’’ During psychotherapy she told about various anger-provoking situations 
in her life. When asked her feelings about the situation, though, she would always reply, “‘No, I 
don’t feel angry, because there is nothing I can do.” The reactions of inhibited anger and pass- 
ivity are striking. The hives began in the summer of 1954 on the toes, then the arms, then all over 
the body except the stomach. Bouts of urticaria seemed triggered (a) before menstruation, (b) 
when there was pressure against her body, ompoteny just after taking off a tight undergarment, or 
bathing suit, (c) when arising at 6:00 a. m. to make her husband’s lunch, and (d) when feeling 
abandoned. ‘As soon as you leave me, I mean, I leave you, they come. Nobody knows how I 
suffer!’ Their appearance is usually —— by the patient’s feeling tense, jumpy, and having 
crawling sensations on the skin. The hives are most painful as they begin to appear. When fully 
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formed they seem to the patient like wounds from a fight. She admits going away by herself and 
scratching the hives (with great pleasure) till they often bleed. 


The 3 ery ee team’s appraisal and a psychoanalysts’s analysis? of the psychotherapy 
material all lead to the same formulation: The central problem seems to be anger over abandon- 
ment augmented by continual feelings of not having received enough. Guilt and fear over what 
these angers might do to others has led to their repression, at the cost of limited consciousness, 
and their projection to authority figures in a sado-masochistic relationship. She now alternates 
between two hostility-binding states: an isolated, passive, tired, discontented state; and an 
active, social state under hurtful, external control. The hives appear most often as she shifts from 
an active to an inactive state and seem to express both resentment about having been under 
pressure physically and psychologically, and about being left alone. Fears of ati , injury and 
abandonment are usually active at this time. The hives seem further activated by the anger 
Me herself (guilt) which she feels as she turns to forbidden oral gratifications (loafing, eating, 
etc.). 


At several points, of course, this case does not fit the overall findings too well. 
The early abandonment is stronger than in most cases, as is the very strong de- 
pressive affect of an anaclitic nature. Though masochism is quite clear, the fund of 
underlying rage is harder to demonstrate. The patient is certainly blocked in the 
norma! expression of hostility but the projective tests do not bring forth the expected 
torrent of underlying hostility. What has not been projected to parent figures, nor 
converted to self-punishment and low self-esteem, nor to indirect, passive-aggressive 
maneuvers, seems very well repressed. The reader can judge the goodness of fit to the 
formulations given. 


TRANSITION TO EXPERIMENTATION 


The next step is to move the formulation to an operational language that can 
then be tested experimentally. To this end a study has been planned that attempts 
to measure the hypothesized psychological and physiological predisposing factors 
and then introduces the precipitating psychological factors with the dependent 
variable being not the emergence of hives but the antecedent mechanism of peri- 
pheral vasodilation with the accompanying skin temperature rise being relatively 
greater than other physiological reactions. 

We hypothesize that non-allergic chronic urticaria patients have the following 
personality characteristics in common: (a) excessive dependency, (b) resentment 
over non-support, (c) depression over non-support and guilt, (d) marked discrep- 
ancy between ideal and actual self-concepts, (e) blocked hostility and masochism, 
and (f) more psychological maturity than most psychosomatic patients. It is felt 
that these patients develop hives when they are placed in situations where: (a) there 
are increased demands on them, (b) their dependency needs are blocked, (c) they 
fear abandonment, (d) their action tendencies are blocked, and (e) their anger is 
aroused yet expression of it is blocked. 

The problem of psychosomatic specificity requires controlled, experimental 
tests of hypotheses such as the following: (a) chronic urticaria patients are more 
characterized by these predisposing personality factors than are other groups, (b) 
the chronic urticaria patients respond to any emotional stimulation with relatively 
greater increase in skin temperature than do other groups, (c) stimulation by the 
specific kinds of stress listed above is required in order to have a hive-like skin re- 
action, and (d) there is a greater tendency to react when there is a combination of the 
factors mentioned above. In order to plan an adequate test of such hypotheses, a 
firm basis in clinical observation is essential. 


SuMMARY 
This paper has presented a case report of a chronic urticaria patient as illustra- 
tion of the findings of an intensive team study of 35 cases and has outlined an ex- 
periment planned to check systematically some of the significant causal factors. 


*The writers wish to thank Monte L. Joseph, M.D., psychoanalyst, for his contribution as super- 
visor of the counseling. 
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THE CRITICAL INCIDENT APPROACH TO THE STUDY OF 
PSYCHOPATHOLOGY* 


JOHN C. FLANAGAN AND FRED W. SCHMID 
American Institute for Research and University of Pitisburgh 


For nearly fifteen years various investigators have been collecting specific ac- 
counts of observed activities following certain rules and procedures and referring to 
them as critical incidents. Before discussing the specific application of the technique 
to the topic of this paper, a brief description of the approach will be given. The 
conditions which are set up to define a series of critical incidents are intended to 
obtain a representative, objective set of factual events all judged by the observer to 
contribute to a single aim or conclusion. Among these conditions are: 


1. The definition of the situation observed. This usually includes the delimitation 
of the incidents admitted to those involving specific persons, locations, cond- 
itions and activities. 


. The relevance to the stated aim or conclusion. This represents a judgment or 
inference on the part of the observer. Because of the explicitness of the de- 
finitions and the established criteria as to what constitutes an incident which 
has sufficient relevance and effect that it should be admitted as one piece of 
evidence regarding the stated aim or conclusion, these judgments and infer- 
ences are usually found to be relatively objective. 


. The qualifications of the observers. In order to make the type of judgments 
and inferences mentioned above, it is essential that the observers have ap- 
propriate experience and training. The nature and extent of this will, of 
course, depend on the specific situation. 


A sample of critical incidents thus defined can be treated like any other set of 
observed data to describe, to define, to compare, or to evaluate specific hypotheses. 
Two rather different uses of critical incidents will be discussed here. First, the use of 
a collection of critical incidents to describe and define specified types of behavior 
without reference to specific persons. Second, the systematic collection of incidents 
to describe the behavior over a given period of time of a single individual. 


*This study was conducted 7. Fred W. Schmid with a grant for partial support from the 


American Institute for Research. e present article was based on a dissertation submitted in Ma 
1958 in partial fulfillment of the requirements of the Ph.D. degree at the University of Pittsburgh 
entitled, A Multidimensional Analysis of the Behavior of Hospitaiized Mental Patients. 
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Tue Uss or Criticau INCIDENTS TO PROVIDE A LAY DEFINITION OF MENTAL ILLNESS 


The first study to be reported involved the collection of 854 critical incidents on 
the behavior of patients in mental hospitals. The incidents were collected in three 
hospitals. However, more than 80 per cent of them came from a single Veterans 
Hospital and they are therefore predominantly about male patients. The group had 
an average age of 34 years and an average period of hospitalization of four years. 
The incidents were primarily reported on patients in closed wards (73 per cent) and 
security wards (20 per cent), with only a few reported from privilege wards (7 per 
cent). The incidents were reported on the Patient Observational Record Forms 
shown in Figure 1. The basic questions were ‘“‘Describe an incident in which the 
patient did or said something which suggested an improvement in his condition”’, 
and “Describe an incident in which the patient did or said something which in- 
dicated his needing further hospital help.’’ About half of the incidents were reported 
by nurses and about half by nursing assistants. Examples of the incidents collected 
are given in Figure 1. 


Figure 1. SampiE Pattent OpsErRvVATIONAL RecorD Form REporTS 





Example A 
Describe an incident in which the patient did or said something which indicated his needing further 
hospital help. 
1. Tell what led up to the situation, and when and where the incident took place. 
In the hallway coming back from Building 1, about 1:30 p. m. 
2. Describe exactly what the patient did or said (be as specific as you can). 
The patient was coming back from Building 1 where he had just received chemo-therapy 
(special chemo-therapy for himself). I said to him, ‘“‘We will get your arm and legs stronger 
by doing this.’’ He said, ‘‘Not me, I can’t get stronger.” I asked why. He said, “I had a 
stroke and got paralyzed. I can’t even move my hand.” 
This is not true, the patient does move his hands. 
Why do you think the patient acted the way he did? 
The patient does not want to do anything and is using this for an excuse. 
The patient has been hospitalized for 3 years. 
(The above incident was recorded on a male patient, 47 years of age, in a closed ward. 
Diagnosis: Anxiety Reaction.) 


Example B 


Describe an incident in which the patient did or said something which indicated his needing further 
hospital help. 
1. Tell what led up to the situation, and when and where the incident took place. 
In the hallway, 10:00 a. m. 
2. Describe exactly what the patient did or said (be as epee as you can). 
The patient had been standing alone in the hallway, looking very depressed and hostile. 
Occasionally he stomped his feet in anger. Then he burst into convulsive sobs (without tears). 
I invited him to come into the day room with the other patients. He refused to do so. He 
said he couldn’t stand the hospital, and he was afraid to be in there. 
I assured him we wanted to help him. He said, “I’m not worth ap a I killed my 
brother.’”’ He remainedin the hall, very tense. Occasionally he would kick the wall. 
Why do you think the patient acted in the way he did? 
The records indicate the patient’s brother lost an eye in a hunting accident. He was not 
killed. The patient appears to feel guilty of the accident. 
The patient has been hospitalized for 3 years. 


(The above incident was recorded on a male patient, 32 years of age, in a privilege ward. 
Diagnosis: Depressive Psychotic.) 





! 

The incidents were classified into 108 categories (64 ‘‘illness” and 44 “improve- 
ment’’), each of which represented a different type of critical behavior. These were 
grouped into 14 behavior areas which in turn were arranged in three major areas 
labeled (I) Aggressive vs. Considerate Behavior, (II) Immature vs. Mature Behavior, 
and (III) Irrational vs. Rational Behavior. The main headings of the classification 
system and the frequencies in these areas are shown in Table 1. 
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Tasie 1. Criricat Benavior AREAS FOR MENTAL PATIENTS 





Signs of Illness Frequency Signs of Improvement Frequency 





I. AGGRESSIVE vs. CONSIDERATE BEHAVIOR 


. Aggression against Others 109 1. Consideration for Others 
. Aggression against Self 22 2. No behaviors of this type observed 





II. Immature vs. Mature BEHAVIOR 
. Sexual Impulses 3. No behaviors of this type observed 
. Daily Needs . Daily Needs 
. Treatment 5. . Treatment 
. Conventional Standards of Conduct . Conventional Standards of Conduct 
. Social Needs and Activities . Social Needs and Activities 
. Difficulties, Stress, and Failure ’ . Difficulties, Stress, and Failure 
. Independence and Responsibility . Independence and Responsibility 





III. InRATIONAL vs. RATIONAL BEHAVIOR 
. Loss of Consciousness 10. No behaviors of this type observed 
. Habitual Responses 11. Habitual Responses 5 
2. Preferences and Aversions 12. Preferences and Aversions 3 
. Communication 13. Communication 9 
. Understanding, Insight and Reasoning 5: 14. Understanding, Insight, and Reasoning 33 





In a check on the reliability with which classification is made, five graduate 
students in psychology were asked to classify 50 to 150 incidents each. These were 
given to them in the raw and unedited form in which they were reported. They 
placed 67 per cent in the correct one of the 108 behavior items and 77 per cent in the 
correct area of the 14 behavior groups. The students were given a brief explanation 
of the classification structure but no training. It is believed that substantially higher 
agreement could be achieved by systematic training and a carefully programmed 
practice session. 


A Srupy or THE CriticAL BEHAVIOR REcoRD 


As mentioned earlier, in addition to the use of critical incidents to describe and 
define a specified type of behavior, such incidents have been found useful in describ- 
ing the behavior of individuals over a given period of time. This study had as its 
primary purpose the experimental evaluation of the Critical Behavior Record on 
Mental Patients. The study included observations on 324 mental patients in three 
mental hospitals for two two-week periods. Graduate nurses, student nurses and 
nursing assistants observed an average of two patients each for two weeks and re- 
corded incidents which they thought indicated continued illness or signs of improve- 
ment on the patient’s Critical Behavior Record. After the initial two-week period 
the observers were given a new group of patients to keep records on for the next two 
weeks. During the four weeks 2972 critical behaviors were observed and recorded. 
Of this total, 1328 were reported as indicators of ‘‘illness” and 1644 were reported as 
indicating “improvement.” 

As a measure of the consistency of the recorded observations, the per cent of the 
behavior areas in which incidents were observed in the first period in which incidents 
were also observed in the second period was calculated. This percentage was 44 or 
just a little less than half. This consistency seems quite satisfactory, especially when 
it is noted that on the average patient, only four or five incidents were reported in 
each two-week period. 

In this preliminary study a number of special circumstances should be noted. 
First, circumstances were such that the nursing personnel could not be given ade- 
quate training in the use of the Critical Behavior Record Forms. Second, the large 
city hospital that included 165, or over half of the patients provided very atypical 
results. Only 468 incidents were reported on these patients as compared with 2504 
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on the 159 patients in the other two hospitals. Thus, in this one hospital only an 
average of 2.8 incidents were recorded on each patient in the four-week period. In 
the other two hospitals an average of 15.7 incidents was recorded on each patient 
during the four weeks. Third, whereas in the initial critical incident study most of 
the reports were on patients in closed and security wards, in the Critical Behavior 
Record Study, most of the patients observed were in privilege wards. 


SuMMARY AND CONCLUSIONS 


Two exploratory studies have demonstrated the value of the critical incident 
approach to the study of psychopathology. With appropriate training and support 
from hospital management, relatively objective reports of certain types of patient 
behavior can be obtained with the expenditure of only a few minutes recording time 
for each patient each week. It is proposed that the Critical Behavior Record Form 
be revised on the basis of the experience in this study and be studied further using 
large samples of patients observed over longer periods of time. 


IMPROVING THE MEASUREMENT OF CLINICAL PERFORMANCE 
OF MEDICAL STUDENTS* 


JOHN T. COWLES AND ALBERT J. KUBANY 
University of Pittsburgh School of Medicine General Motors Institute 


PROBLEM 

There has been a growing realization by teachers and administrators of medical 
schools that we must improve our measurement of the more significant aspects of 
student performance before we can hope to improve our selection of students“). 
Stated simply, we need more reliable and professionally relevant criteria for validat- 
ing selection techniques. This paper reports an early phase of research directed to- 
ward the identification and objective measurement of the principal aspects of the 
clinical performance of medical students. 


Most of the correlational studies of the efficacy of various selection measures for medical 
students have used the available grades in medical school as criteria“). For various reasons, 
these studies have relied more heavily on measures of performance in the first two, or preclinical, 
years devoted to the basic medical sciences. One reason is the earlier maturation of criterion data 
from those two years; another is the fact that most of the failures and other withdrawals occur 
during the first two years. A further, less obvious reason is that the course grades or annual 
averages of the first two years ty pically contribute more than do the grades of the last two, clinical 
years to the final, four-year average or ranking of graduating medical students. This is due to the 
larger standard ‘deviations and consistently higher intercorrelations of principal grades and 
averages in the first two years. Therefore, even when the four successive annual averages are 
given equal numeric weight in determining four-year standing, the averages of the first two years 
will contribute much more heavily to the student’s four-year average. Formal written examina- 
tions and relatively objective grading exercises are common during the first two years when the 
student’s performance is measured largely in academic lecture; laboratory, and reading situations. 
However, in the last two years grading takes place in the much more ‘complex situations of his 
hospital and clinic clerkships; grading is done in a more subjective manner, usually without the 
aid of well-defined instruments or criteria. 

When validity coefficients have been computed between selection measures and separate 
subject grades, it appears that the pre-admission measures of academic proficiency or aptitude for 
advanced study in science, such as the undergraduate college average or the Medical College Ad- 
*This work, done at the University of Pittsburgh, was supported partly by a grant to that Uni- 

versity and to L oyola University (Chicago) from the Commonwealth Fund of New York and partly 
by a research fellowship to the junior author from the U. 8. Public Health Service. 
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mission Test, may predict medical school grades as well as any available selection test or tech- 
nique. Some predictions of overall medical school performance, including dropout for various 
causes, from combinations of these narrowly academic type of measures have been consistently 
high, but at best they still leave 50 percent or more of the variance in medical school grade aver- 
ages unp edicted. A fair amount of this unpredicted portion may, of course, be error variance. 

From data on the four most recent classes graduating from the University of Pittsburgh 
School of Medicine, mult‘ple-R’s ranging from .47 to .73 for separate samples, and an average R of 
.64 for the four samples, were obtained in predicting final school status including rank in class or 
dropout for any cause. These R’s were based solely on the following variables: birth year, high 
school rank, college average, and the four subtests of the Medical College Admission Test. 
sults of cross-validation of the several sets of regression weights demonstrate that similar pre- 
dictive efficacy of these admissions measures could be relied upon for subsequent classes. 

Various studies have shown that we cannot easily improve upon these predictions based on 
fairly standardized measures of ac:.demic aptitudes. Supplementary personality measures have been 
tried, such as interview data, inventories of motivation, interest, or adjustment, and even some 
projective devices originally designed for psychiatric screening and diagnosis. Despite much effort 
with promising personality measures, their yield beyond the usual academic predictors has been 
disappointing indeed. It has become increasingly clear that the medical school evaluations of 
students must first be improved, particularly to obtain measures of physician-like performance in 
the hospital wards or out-patient clinic, rather than of academic performance in classroom and 
laboratory. 

Appropriate situations for professionally-relevant and realistic measures occur in the clinical 
work of the student’s third and fourth years. It is here, especially in the department of medicine, 
that the student has extensive contact with patients and their medical problems. Here, too, exists 
the opportunity for close observation of the student in small preceptorial groups. And in the 
fourth year the student assumes greater professional responsibility, under the preceptor’s guid- 
ance, for the patient’s interview, examination, diagnosis, and management. The scheduled hours 
for each student in his work in the department of medicine alone may total several hundred in 
each of these two years. 

While extensive sociological studies by interview and questionnaire have been carried out by 
others, to determine the attitudes of medical students during their clinical years toward patients, 
nurses, doctors, and the medical profession “), it was the writers’ conviction that much more is to 
be gained toward obtaining objective and reliable criterion measures of student performance if 
they are more directly based on what the student does—what others directly observe of him— 
rather than on what he says or claims to believe in interview or questionnaire reports. 


PROCEDURE 
Twelve experienced faculty members were therefore interviewed at length in the 
fall of 1955 to determine from them, as experts, what were the most important 
characteristics for a student to possess or acquire in preparing for general practice, 
and secondly which of these characteristics were most easily observed in the teaching 
situations of the third and fourth years. The following reduced list of characteristics 
resulted from an analysis of the tabulation of these answers, and with certain phrase- 
ologies adapted from Diederich’s study of critical incidents reported by practicing 
physicians ®?: ; 
1. Knowledge of medical information. 
2. Ability to gain and maintain patients’ confidence. 
3. Assumption of responsibility for, and insight into patient’s total problem: medical, 
social and emotional. 
4. Skill in observing, recording, and reporting. 
5. Skill in developing and verifying hypotheses from patient data. 
6. Stability under difficult situations. 
7. Integrity: honesty, ethics, recognition of own limitations. 
8. Interest in profession and in self-improvement. 


Interestingly, very similar lists of important characteristics have been independently 
obtained in two other criterion studies carried out at the same time as the present 
one, “: 1°) 

Two approaches toward the measurement of these characteristics were then 
undertaken. One was the use of Kubany’s“? experimental sociometric scale, based 
on students’ nominating for each of the characteristics the three students from among 
their classmates whom they judge to possess the greatest degree of the particular 
characteristic. Another approach, having the advantage of being suitable for ad- 
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ministrative use in the grading of students by faculty, was the observational rating 
form called the “Clinical Performance Rczord’”’ (CPR),! consisting of brief defini- 
tions of the eight characteristics, with appropriate spaces for rating the student’s 
position in his class as top 1/5, middle 3/5, or lowest 1/5, or ““Did Not Observe.” To 
derive a rating score, whether for one characteristic or for all 8 characteristics, a 
“Top 1/5,” ‘“Middle 3/5,”’ or “Lowest 1/5” was counted as a 3, 2, or 1 respectively. 
In addition, several lines of space were provided for each characteristic to permit the 
rater to describe with written comments the behavior of the student which occasioned 
his particular rating of the student. Detailed instructions for using the CPR were 
also furnished each faculty member, including specimen comments describing 
effective or ineffective behavior for each characteristic. 

The data to be reported here were obtained from CPR’s for 41 fourth-year 
students, each student being rated independently by an average of 4 preceptors in 
the departments of pediatrics and medicine at the University of Pittsburgh School of 
Medicine. Ratings were accomplished as each student completed his service with 
that department in the spring of 1956. 


RESULTS 

Preliminary analysis of the quantitative ratings on each characteristic showed 
that the number of ‘‘Did Not Observe’’ checks was small, except for characteristic 
No. 6 (Stability), and corresponded generally with previous faculty judgments as 
to the relative difficulty of judging the 8 characteristics. The “Stability” item was 
therefore eliminated in a subsequent revision of the CPR. Distribution of the 1268 
ratings revealed an upward bias of ratings toward the more favorable end of the 
scale, namely, 31% in the “‘Top 1/5’, 64% in the “‘Middle 3 /5’’, and only 5% in the 
“Lowest 1/5” of the class. It should be pointed out that each preceptor rated only a 
very few students, and this was his first attempt at making such ratings. However, 
this upward bias of the ratings has been observed in ratings by experienced pre- 
ceptor-raters after two subsequent years of such ratings. Total CPR rating scores 
were also obtained for each student by totalling and averaging all ratings assigned 
on all characteristics by his several preceptors. 

The reliability of these average ratings for each student on each characteristic, 
as well as reliability of total CPR scores, was estimated by an analysis of variance 
technique which took into account the varying number of raters. Product-moment 
intercorrelations of the average ratings on each characteristic were also computed. 
The intercorrelations and reliability estimates are presented in Table 1. 


TABLE 1. INTERCORRELATIONS AND RELIABILITIES OF AVERAGE RatiInGs ON THE CLINICAL 
PERFORMANCE Recorp (N = 41)* 





Character- Intercorrelations Relia- 
istic 1 2 3 4 5 bility 


Medical knowledge .3l .38 .58 P ‘ i : — 
Gain pt’s. confidence .49 14 ‘ 7 i ‘ .36** 
Responsib. for pt’s. tot. prob. 41 : : am .57 -.10 
Observing & recording ; : : : .49*** 
Devel. & verif. hypotheses ; : 5 .69*** 
Stability in diffic. sits. , : .49** 
Integrity ‘ -.03 
Interest in profession 22 
Total Score .58*** 


* Except for the intercorrelations of characteristic number 6, where N = 40, and for the reliability 
coefficients of numbers 5 to 8, where N ranges from 31 to 39. 
** Significantly different from zero at the 5% level of confidence. 
*** Significantly different from zero at the 1% level of confidence. 





aSNOo rR Whe 





1A specimen copy of the CPR may be obtained from Dr. John T. Cowles, School of Medicine, 
University of Pittsburgh, Pittsburgh 13, Penna. 
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The reliability of average ratings of characteristics ranged from essentially zero 
to +.69, with five of the eight characteristics having coefficients significant at the 
5% level, and the total scores having a reliability coefficient of .58, significant at the 
1% level. These coefficients of reliability compare favorably with similarly derived 
coefficients from other rating scales of complex behavior. Logically, the more reliable 
measures concern the more objectively defined and observable items, namely, 
knowledge of medicine, observing and recording, and diagnosis and verification. The 
items with least reliability were among those earlier judged most difficult to observe, 
namely, responsibility and integrity. The intercorrelations of these average ratings 
show a higher positive relation among these same characteristics which are measured 
more reliably, namely, knowledge, observing and recording, and diagnostic skill. The 
generally low coefficients suggest relative independence among the ratings of various 
characteristics, but this is to be verified with a much larger sample. It is possible that 
some selection of evaluators, choosing only those who are the more consistent raters, 
will increase the reliability of ratings, and likewise the magnitude of intercorrelations. 

Further evidence of the promising nature of these CPR ratings was seen in the 
correlation of total ratings with major subject grades and averages of the fourth 
year, as well as with four-year averages. These data are presented in detail by Ku- 
bany“’. In general, the correlations with the major subject grades and averages of 
the fourth year were positive and significant at the 5% level or better, including 
grades in medicine, pediatrics, psychiatry, fourth year average, and four-year aver- 
age. Correlations with two other subjects, surgery and obstetrics, were insignificant; 
however, these same subjects also showed very low correlations with the major sub- 
jects named above. 

Review of the written comments which were submitted on these CPR’s showed 
that they were less frequent than might be hoped. Only 195 comments accompanied 
the 452 ratings as ‘“Top 1/5” or ‘““Lowest 1/5” of the class. This may be due to the 
initial inexperience of the faculty with such a procedure; a number of preceptors have 
stated that they do not observe their students over a sufficient period of time to feel 
sure of their evaluations. Another reason is that most of the physician-preceptors 
were part-time faculty, giving their time voluntarily from a busy practice, and were 
unable or reluctant to spend additional time for the grading process. The comments 
ranged widely from very specific descriptions of critical situations to gross general- 
izations, such as “average student in most respects’. Many interesting dimensions 
or traits of behavior were mentioned which were barely suggested by the captions on 
the CPR. Some preceptors made frequent comments and some made no comments 
whatever. Obviously, the success of the descriptive evaluations depends on full co- 
operation and communication by all preceptors. Because of the diversity of com- 
ments and relatively small total number of them, no categorization or other detailed 
analysis was attempted at this time. Representative comments are reproduced, 
together with some additional CPR data, elsewhere“. 


SUMMARY 


This paper reports encouraging results from the introduction of a new medical 
student rating instrument, which appears to have some reliability and some con- 
current validity in relation to major subjects of the fourth year. The Clinical Per- 
formance Record (CPR) has now been continued as a regular evaluation device for 
all third year and fourth year students in the department of medicine. Several 
hundred comments have been collected during two successive academic years. The 
frequency of comments has increased as preceptors have become more familiar with 
the evaluation process, and as departmental staff recognize the value of the sum- 
marized comments for providing a diagnostic, verbal description of strengths and 
weaknesses of each student. Certain preceptors are providing very comprehsensive 
appraisals of all their students. A detailed content analysis and psychometric scaling 
of these obtained comments is now under way. As an extension of this project, 
specific areas of assessment such as “Diagnostic Skill’ are being further objectified 
by Rimoldi® and others, through the development of separate testing devices. 
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A SUGGESTED INSTRUMENT FOR EVALUATING 
SPEECH THERAPY WITH CEREBRAL PALSIED ADULTS* 


JACK MATTHEWS AND ERNEST J. BURGI 
University of Pittsburgh University of Nebraska 


INTRODUCTION 

The effectiveness of speech therapy can be evaluated in part by improvement in 
the understandability of the person receiving speech therapy. This is not to deny 
the importance of such factors as improvement in general adjustment as well as 
improvement in voice quality, speech rhythm, volume and other aspects of speech. 
In a series of studies at the University of Pittsburgh attempts have been made to 
describe improvement shown during speech therapy by comparing pre-and post- 
therapy measures of the intelligibility of the speakers undergoing therapy and to de- 
termine important correlates of speech intelligibility in order to devise simple clini- 
cally useful measures of intelligibility. Dietze® found that a measure of consonant 
articulation ability (weighted for the frequency of occurrence of the various con- 
sonants in English speech) showed substantial correlation with the intelligibility of 
the speech of a group of elementary school children with functional articulation dis- 
orders. McWilliams *? had each of 48 cleft palate subjects record on tape one of 12 
lists of words, phrases and short sentences phonetically balanced in accordance with 
the incidence of consonant sounds found in conversation by Travis". Each list 
was designed to contain the 23 major consonant sounds in proportions comparable 
to the appearance of these sounds in conversation. A weighted articulation score was 
obtained from an analysis of articulation errors made by each speaker. An intelligi- 
bility score was derived based on the percentage of recorded words which could be 
understood by a group of independent judges. McWilliams found a correlation of .72 
between the weighted articulation index and the independently derived measure of 
intelligibility. 

*This study was made possible in part by a grant from United Cerebral Palsy Association of the 
Pittsburgh District. 
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The present study continues to explore relationships between measures of 
articulation and measures of intelligibility—in this instance in a population of cere- 
bral palsied patients with speech problems. This investigation is also concerned 
with developing improved measures of intelligibility and simpler, more-easily-em- 
ployed-in-the-clinic measures of articulation. 


METHODS 


Intelligibility Measures (Criteria). Intelligibility scores are influenced by the type of 
unit of speech being evaluated. When speech units larger than a sentence are em- 
ployed the ratings of these larger units become quite subjective. When speech 
units shorter than words are employed it becomes difficult to obtain a sufficient 
range of values of intelligibility scores, 7.e., with short speech units such as a single 
sound it becomes difficult to obtain high intelligibility values. A more complete dis- 
cussion of these methodological problems can be found in Egan. 

. The present study employed two measures of intelligibility. The first consisted 
of the Harvard PB Lists®’ employing phonetically balanced words and the second 
was the Harvard SI Lists“ employing sentences. Both of these measures have been 
widely used in assessing discrimination loss in hard-of-hearing individuals. The 
lists, when spoken by normal speakers, have served as standardized stimuli to pre- 
sent to individuals with hearing loss. The subject with a hearing loss records what 
he hears and this becomes the raw data for determining a measure of his sound dis- 
crimination. In the present study the Harvard lists were spoken by the cerebral 
palsied subjects. Their recorded speaking of these lists served as stimuli to present to 
judges with normal hearing. These judges wrote what they heard. The reports of 
these judges became the raw data for determining a measure of the intelligibility of 
the cerebral palsied speakers. We have utilized word and sentence lists to evaluate 
the intelligibility of speakers whereas intelligibility lists previously have been used to 
assess equipment (hearing aids, transmission lines, etc.) from the standpoint of its 
ability to transmit intelligible speech. 


Articulation Measures (Predictors). Measures of intelligibility require more time and 
equipment than most clinical settings can afford. What is needed is a simple instru- 
ment that is economical of time and equipment but which is highly correlated with 
the more time consuming intelligibility measures. The predictors used in this study 
consisted of several measures of articulation skill. Each subject produced each of 24 
consonant sounds in each position in which the sound occurs in English. These 
sounds consisted of [m], [p], [b], [w], [f], [v], lel, [3], th], [t§], (n], ft], (d], [rl], [s], [2], 
(i), {), (8), [3], [43], fp], {k], [g]. In addition the following 13 consonant blends were 
tested in the initial positions of words: [br], [dr], [tr], [kr], [gr], [pl], [kl], [st], [sp], 
[sk], [sm], [sn],, and [sl]. The following 9 consonant blends were tested in the final 
position of words: [ts], [ns], [It], [Id], [Ik], [rk], [rn], {rt], and [rd]. The results of this 
articulation test were scored in a variety of ways—each resulting in a ‘“‘predictor”’ of 
intelligibility. Two of the predictors were based on a relatively complex analysis of 
the articulation test. The first of the complex predictors will be called the Wood 
Index. It is a measure weighted with respect to the frequency of occurence in the 
English language of each consonant sound. The second complex predictor will be 
referred to as the Mader Index? and is based on data dealing with frequency of 
occurence of sounds in each word position in transcribed speech. 

Six simple predictors were derived from a less complex analysis of the articula- 
tion test. The first simple predictor consisted of a count of the number of correctly: 
produced consonants and consonant blends. The second consisted of a count of 
correctly produced single consonants (no blends were included). The third was a 
count of correctly produced initial consonants. The fourth was a count of correctly 
produced medial consonants. The fifth predictor was a count of correctly produced 
final consonants. The sixth measure will be referred to as the Irwin Index“) and 
consisted of a count of the number of correct productions of five front consonants, 
[p}, [b], [m], (d], and [t). 
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PROCEDURE 


Speech samples were obtained from 17 male and 12 female cerebral palsied sub- 
jects ranging in age from 17 to 49 years. A more complete description of recording 
procedures, speech samples, etc. can be found in Burgi and Matthews. All speech 
samples were recorded on tape for further analysis. Following a period of training 
five graduate students in speech and hearing disorders listened to the taped articula- 
tion tests and independently judged the correctness of production of each consonant 
and consonant blend in the tape-recorded articulation test. 

A separate group of nine judges was employed to secure measures of the in- 
telligibility of the cerebral palsied speakers. These judges were graduate students in 
education whose hearing and listening ability previously had been determined to be 
satisfactory. These judges wrote what they were able to understand of the record- 
ings made by the cerebral palsied speakers of the PB and SI lists. The responses of 
each judge were scored correct if the word written by the judge was the same or 
phonetically equivalent to the key word in the origina! PB or SI list. The intellig- 
ibility score of each subject was determined by averaging the scores given that sub- 
ject by each of the nine judges. The reliability of all measures was determined and 
found to be satisfactory. (Percentages of agreement among judges ranged from 80% 
to 91% and average intercorrelations among judges were in the high .90’s.) 


RESULTS 
Table 1 summarizes the major findings of the present investigation. All of the 
reported correlations are significantly different from zero. Table 1 indicates that all 
of the measures of articulation skill employed in this study were highly correlated 
TaBLe I. Pearson r’s OBTAINED FOR EvaLuaTING RELATIONSHIPS BETWEEN 


Prepictors (ARTICULATION MEASURES) AND MEASURES OF INTELLIGIBILITY 
(PB anv SI Scorsgs) 





Intelligibility Measures 
Predictors PB 


Wood Index 91 
Mader Index .88 
Count of Consonants and Blends .90 
Count of Consonants 91 
Count of Initial Consonants .86 
Count of Medial Consonants .90 
Count of Final Consonants .88 
Irwin Index .68 








with measures of word and sentence intelligibility. This is consistent with previous 
research carried out at the University of Pittsburgh in populations of cleft palate 
and of functional articulation cases. Of equal interest is the finding that with the 
exception of the Irwin Index most of the shorter and more clinically feasible measures 
of articulation correlate just as highly with intelligibility measures as do the more 
complicated measures such as the Wood or the Mader Index. 


CoNCLUSIONS 
This research suggests that satisfactory measures of intelligibility of the speech 
of cerebral palsied subjects can be obtained from a relatively simple procedure which 
consists of counting the number of consonant sounds the speaker can correctly pro- 
duce. Such measures can be obtained as a rule in a matter of several minutes. Often 
data for such measurements may already be available in the recordings of speech 
therapy sessions. Periodic counts of articulation skill might very well be employed 
as one index of the effectiveness of speech therapy with cerebral palsied adults. This 
technique shows promise for use in evaluating the effectiveness of speech therapy 
with clinical groups other than cerebral palsied. 
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NEEDS, VALUES, PERCEPTIONS AND THE NURSE-PATIENT 
RELATIONSHIP* 


J. FRANK WHITING 
Veterans Administration Hospital, Leech Farm Road, Pittsburgh, Pa. 


A growing body of evidence supports the view that personal needs, values, per- 
ceptions and overt behavior in inter-personal relationships are very intimately 
linked together. In general, the inter-relationships of these concepts and behavior 
phenomena can be formulated as follows: The well-springs from which human be- 
havior derives are personal needs and the satisfaction thereof. As a result of the 
particular individual’s history of experience with need-satisfaction, he comes to 
value certain behavior in himself and others. This valuing process leads to the estab- 
lishment of certain more or less stable response mechanisms which determine the 
way in which the individual perceives particular acts which he and others perform. 
These perceptions, in turn, are the determinants of behavior in the everyday life 
situation. This article presents illustrative data from a current research program 
3, 4, 5) in the Pittsburgh area studying the relationships between personal needs, 
values and perceptions as observed in nurse-patient relationships in a modern hos- 
pital. 


MetuHop or Data COLLECTION 


Our data were collected with a specific Q-sort technique, developed for the 
special purposes of these studies and described elsewhere“). Each subject, e.g., 
patient, nurse, physician, etc., was asked to sort the set of 100 statements written on 
individual 3” x 5” cards into nine piles, representing points on a continuum of relative 
importance. The specific instructions were: 


“These days, nurses are called upon to do a lot of things with patients. All of these activities 
are worthwhile and important, but all of us have a limit to our time and energy. Our study is one 
in which we want to measure what various people—nurses, patients, doctors, and others—feel are 
more important and less important activities for the nurse to carry out in giving patient care. 
Your job will be one of sorting 100 cards with statements written on them about things nurses do 
in caring for patients. While you are sorting the cards, you should keep the following in mind: 
‘Which of these activities do you feel are of high importance, of medium importance, of low im- 


*We are indebted to the staffs of the SRovignepiiel in which the data presented in this article 


was collected: VA Hospital, Leech Farm Road, Pittsburgh, Pa., Western Psychiatric Institute and 
— Pittsburgh, Pa., VA Hospital, Rutland Heights, Mass. and Pittsburgh Hospital, Pittsburgh, 
‘a. 
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portance in the nurse’s job in caring for patients on a treatment ward in a hospital?’ (The 
appropriate term describing the type of setting in which the subject was involved, e.g., tubercu- 
losis, psychiatric, general medical and surgical, was inserted into the blank space.)’”’ 


The end product of the Q-sort procedure was the 100 behavioral statements 
separated into the following distribution: 


Most relative importance Least relative importance 


Pile Number 1 2 3 4 5 6 7 8 9 
Number of statements 1 4 11 21 26 21 1l 4 1 


MEDICATION AND THE NuURSE-PATIENT RELATIONSHIP 

Traditionally, a focal point of contact in nurses’ and patients’ inter-relation- 
ships has been the nurse’s administration of medication to the patient and her evalua- 
tion of the patient’s reaction to the effects of medication. There are seven items in 
our Q-sort stating behaviors the nurse carries out in relation to the medication aspect 
of patient care. These items are presented in Table 1, together with mean ratings 
of the relative importance of the behaviors as perceived by a group of 70 registered 
nurses and 125 patients in a tuberculosis hospital. 


TABLE 1. CoMPARISON OF NURSES’ AND PATIENTS’ PERCEPTION OF MEDICATION IN A TUBERCULOSIS 
Setrine (Patient N = 125, Recisterep Nurse N = 70) 





Group Per- 
ceiving item 
Mean Mean as relatively 
Behavioral Statement RN PT more important 





The nurse watches the patient for any toxic 

symptoms following the administration of 

medication. 3.07 3.60 <.01 RN 
The nurse gives the patient in pain pre- 

scribed medication. 4.11 3.70 <.01 PT 
The nurse refers to the doctor the patient 

who will not take his medicine. 4.39 4.01 <.01 PT 
The nurse gives prescribed medication to 

the patient who is unable to sleep. 4.76 4.15 <.01 PT 
The nurse explains to the patient why he 

needs to take his medication. 4.26 5.10 <.01 RN 
The nurse stays with the patient until he 

has taken his medication. 4.47 5.79 <.01 RN 
The nurse is understanding when the pa- 

tient refuses his medication. §.21 5.62 <.01 RN 





There are significant differences between the levels of relative importance with 
which each of the seven items were perceived by the patients, on the one hand, and 
the nurses, on the other. Thus, in this tuberculosis setting at least, medication, as it 
relates to patient care in hospitals, is a vital issue concerning which different people 
make quite different value judgements. More specifically, if we look at the content 
of the items emphasized by nurses and patients, we note an essential difference be- 
tween the nurses’ and the patients’ views concerning patient care which has been 
corroborated by our other findings not discussed in this paper. The nurses empha- 
sized items involving (a) observation of toxic symptoms following medication; (b) 
the explanation of the need for medication; (c) staying with the patient until he 
takes the medication; and (d) being understanding when a patient retuses medica- 
tion. These activities represent at once a synthesis and a cross-section of the nurse 
as a professional person integrating the use of physical agents with the use of herself 
as a psychological agent to promote healing, and satisfying the need to see herself as a 
helping person who competently uses her professional skills to facilitate healing. 

The patients emphasized (a) administration of medication for the relief of pain; 
(b) administration of medication for the relief of insomnia; and (c) the reporting to 
the physician of a patient who does not cooperate in taking his medication. These 





148 J. FRANK WHITING 


activities, taken together, imply a much more circumscribed role for the nurse vis-a- 
vis the patient. Nevertheless, each of these activities basically involve the patient’s 
need to ameliorate noxious physical and psychological experiences, 7.e., in relation to 
pain and insomnia. When a patient refuses to take medication, he usually expresses 
fear, hostility or other disturbing feelings which can have an unfortunate effect on 
his fellow patients, thus the patient’s desire to ameliorate noxious experiences is 
also involved. 

This data, in conjunction with other findings, indicate that the central factors 
which make understandable all of the values and perceptions of behavior within the 
nurse-patient relationship, are the immediadely experienced needs of the nurse and 
of the patient in relation to healing, and the nature of the setting in which healing 
takes place. Since the nature of the setting is held constant in relation to the above 
data, the differences between the nurses’ needs and the patients’ needs becomes the 
decisive factor shown in the above data. 


Tue INFLUENCE OF THE NATURE OF THE SETTING AND THE TYPE OF PERCEIVER ON 
VALUES AND PERCEPTIONS OF THE NURSE-PATIENT RELATIONSHIP 


We have noted above that the central factors influencing the perceptions and 
values relating to the nurse-patient relationship are the nature of the setting and the 
immediately experienced needs of the people in the setting. We now present a cor- 
relational matrix demonstrating the influence of both of these general variables on 
perceptions of the nurse-patient relationship. 

The correlations in Table 2 are derived from a sampling of the correlations cal- 
culated on the Q-sort data. In order to obtain an array of scores for calculation of 
the correlation existing between the perceptions of two groups of individuals (regist- 
ered nurses and patients in the tuberculosis setting), we summated the ratings given 
to each of the 100 items in the Q-sort by each of the individuals in the patient and the 
registered nurse groups. This procedure gave a set of 100 item-sums for the patients 
and a set of 100 item-sums for the nurses. We then calculated inter-correlations be- 
tween these sets of 100 sums for each pair of the experimental groups by means of 
the Pearson r method. Thus, a high correlation indicates a marked similarity in the 
level of relative importance assigned to particular behaviors, while a low correlation 
indicates a marked dissimilarity in the level of relative importance assigned to the 
particular behaviors by the particular groups correlated. One further point needs to 
be made in interpreting the data in Table 2. The reader will recall that each of the 


TaBLe 2. CoRRELATION Matrrix* 








TB Setting Gen. Med. Setting Psychiatric Setting 
Groups RNs NAs PTs Frs Jrs Srs VARNs WPIRNs WPINEd WPIMD 





Tuberculosis RNs 
Setting NAs .73 
PTs ; 


Gen. Med. Frs : : 
Setting (st. Jrs . ‘ : .88 
nurses) Srs q a ‘ .83 


VARNs d 2: -30 
Psychiatric WPIRNs 48. .- oe oe ; -89 
tting WPINEd.| . ‘ : a ‘ -76 81 
WPI. on : ‘ q 25. ‘ -76 -80 59 








*Tuberculosis Setting: RNs—Registered Nurses, N = 70; NAs—Nursing Assistants, N = 82; 
PTs—Patients, N = 125. General Medical Setting (student nurses): Frs—Freshmen, N = 36; Jrs.— 
Juniors, N = 41; Srs—Seniors, N = 35. Psychiatric — | VARNs—Veterans Administration 
Hospital, registered nurses, N = 86; WPIRNs—Western Psychiatric Institute, Nursing Service, 
registered nurses, N = 38; WPINEd—Western Psychiatric Institute, Nursing Education, N = 9; 
WPIMD—Western Psychiatric Institute, Psychiatric Residents, N = 31. 
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subjects was asked to sort the behavioral statements in terms of his judgment as to 
their importance in the nurse’s job in caring for patients in the particular setting with 
which the subject was involved. Therefore, the inter-correlations between groups in 
different settings, e.g., nurses in a psychiatric setting and nurses in the tuberculosis 
setting, indicate the extent to which these different groups use a common value 
system in perceiving the nurse-patient relationship in the two different settings. The 
same point applies to all of the inter-correlations between groups who are involved 
with different settings. 

There are several interesting points indicated by the data in Table 2. First, 
each of the inter-correlations between groups within each of the settings, e.g., within 
the psychiatric setting, is substantial. The highest correlation is between the register- 
ed nurse groups in two different psychiatric hospitals (.89) ; the lowest correlation is 
between psychiatric residents and nursing educators (.59). Since the lowest inter- 
correlation implies a significant degree of shared variance (35%), we can conclude 
that within a given setting in which the healing takes place, there does exist a sub- 
stantial core of values which are influencing the perception of the nurse’s behavior in 
the nurse-patient relationship. 


Second, granting the existence of this common core of values influencing per- 
ceptions within a given setting, the amount of sharing of values is quite definitely 
related to the background of the different groups in the given setting. Thus, in the 
psychiatric setting, we find a shared variance of 79% between two groups of register- 
ed nurses in two different psychiatric hospitals who not only have a good deal in 
common with respect to training, educational background, etc., but who face very 
similar problems in patient care. The same point applies to the inter-correlations 
between the student nurse groups within the general medical and surgical setting. 
In contrast, when we compare two diverse groups who not only have different per- 
sonal backgrounds, but also face different types of problems within their day-to-day 
life situations, we find a marked diminution in the amount of shared variance in- 
dicated by the inter-correlations existing between these latter groups, e.g., psychiat- 
ric residents as compared with psychiatric nursing educators (35%), or registered 
nurses in the TB setting as compared with patients in the TB setting (39%). 


Next, if one examines the inter-correlations between groups, each of which are 
involved with a different type of setting, some further marked contrasts in shared 
variance can be observed. These contrasts are most evident in the inter-correlations 
between the various levels of student nurses in the general hospital setting, on the 
one hand, and the groups in the tuberculosis and psychiatric settings, on the other. 
The correlations between the perceptions of the registered nurses in the tuberculosis 
setting and the different levels of nursing students are quite high (.78 - .82), with no 
essential difference existing between the inter-correlations of the TB registered 
nurses and beginning students, on the one hand, and TB registered nurses and grad- 
uating students, on the other. Thus, the experienced nurses and the student nurses, 
irrespective of level, share a considerable common core of values in their perception 
of the nurse-patient relationship. Secondly, the correlations between the different 
levels of student nurses and the patients and nursing assistants in the TB setting are 
also quite high (71% for nursing assistants and freshmen nursing students, and 55% 
for patients and freshmen nursing students), while there is a marked diminution of 
shared variance between the graduating nursing students and the two TB groups 
(36% for nursing assistants and senior nursing students, and 33% for patients and 
senior nursing students). Thirdly, if one examines the inter-correlations between 
the different levels of student nurses in the general hospital setting, and the various 
groups in the psychiatric setting, a change of perception in the opposite direction 
can be observed. The reader will note that the correlations between each of the 
groups in the psychiatric setting and the freshmen nursing students is quite low. 
However, the correlations between each of the groups in the psychiatric setting and 
the senior nursing students is substantially larger. 
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Now, if one integrates the three results cited above, namely, (a) the high and 
stable correlation between experienced nurses in a TB setting and all levels of student 
nurses; (b) the diminution of similarity of perception between freshmen and senior 
students, on the one hand, and nursing assistants and patients in the TB setting, on 
the other; and (c) the increase in similarity of perception between freshmen and 
senior students, and various groups of nurses and physicians in the psychiatric set- 
ting, it is clear that as professionalization of the students through nursing education 
proceeds, their perceptions change and, by inference, their values. It follows, then, 
that crucial differences in behavior are occurring which can have a marked effect on 
the degree of communication and understanding existing between and among these 
several groups. Limitations of space do not permit the analysis of the content of be- 
haviors which underlie these shifts of perception, and which account for increases 
and decreases in the similarity of points of view held by the different groups. Such 
an analysis is crucial to an understanding of how the similarities and differences of 
values and perceptions affect patient care in the modern hospital. 


SUMMARY 


This paper reports illustrative data from a research program dealing with the 
relationships between personal needs, values and perceptions as they influence a 
particular inter-personal relationship in the modern hospital. The current research 
is empirical, and is hypothesis-generating rather than hypothesis-testing. The 
particular empirical question which guides the work at this stage is: “WHO feels 
WHAT are, relatively, the most important aspects of the complex present-day nurse- 
patient relationship, and HOW should these aspects be balanced for optimum patient 
care in various settings?’”’ The implications of this question are discussed. 

Two sets of data are presented to illustrate the kinds of findings being made in 
the course of the research. The first set of data deals with a comparison of registered 
nurses’ and patients’ perceptions of various behaviors involved with the nurse’s role 
in relation to medication, in the tuberculosis setting. The second set of data consists 
of a correlational matrix indicating the degree of similarity of perceptions existing 
between several groups of individuals in the tuberculosis, psychiatric and general | 
hospital settings. 
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EFFECT OF FORM-COLOR INCONGRUITY 
ON RESPONSES TO INKBLOTS* 


MILTON WOLPIN AND ROY M. HAMLIN 
Western Psychiatric Institute and University of Pittsburgh School of Medicine 


Research using Rorschach or other inkblot materials has in general not sup- 
ported, nor clarified, the hypothesis that “color shock” phenomena depend on an 
association between color and affect as suggested by Rorschach “?, Schachtel® and 
others. Siipola®® and Bruner and Postman? have offered an alternative hy- 
pothesis, essentially that it is the incongruity between what is suggested by the form 
of a blot and what is suggested by the color which leads to the behaviors which are 
labelled emotional. A subject will encounter certain difficulties when asked to 
identify ‘‘a cat colored like a frog”, or a playing card with color and suit reversed, 
such as the three of hearts colored black“. 

When asked to identify, or respond in some other way to, a stimulus with such 
form-color incongruity, the behavior of subjects may be characterized by longer 
reaction times, more variability in content of responses, and more verbalized aware- 
ness of color. Furthermore, some subjects may show ‘“‘disruption”’ in their behavior: 
“‘a gross failure of the subject to organize the perceptual field at a level of efficiency 
usually associated with a given viewing condition” ®?, 

The effects of form-color incongruity have in general been supported by experi- 
mental studies where the stimuli were selected on an a priori basis to maximize in- 
congruity, and where the experimental! procedure imposed systematic limits on the 
responses which a subject could give. As examples, the studies of Bruner and Post- 
man), Berlyne“, and Hamlin, Stone, and Moskowitz may be cited as clearly 
supporting the incongruity hypothesis. Each of these studies employed widely 
different procedures and stimuli, and different responses; yet all agree in confirming 
the effect of perceptual incongruity in increasing response time. In contrast to such 
studies, other experiments have attempted to confirm the incongruity hypothesis 
with various “‘unstructured”’ stimuli, and with responses consisting of “inventive 
associations’”’ to a blot or the like. These studies have yielded conflicting and in- 
conclusive results. The studies of Siipola®®, Lazarus and Oldfield“, and of Berg 
and Polyot®) may be cited as examples. These studies justify the conclusion that 
the effect of incongruity is at least very difficult to demonstrate with unstructured 
materials and free responses resembling those employed in the Rorschach. 


HyYPoTHESES 


The present paper reports experimental results pertinent to the problem of 
demonstrating the effect of form-color incongruity with unstructured inkblots and 
with Rorschach-like responses. The following hypotheses were tested: (a) Incon- 
gruity will increase reaction times. (b) Incongruity will decrease number of popular 
responses. (c) Incongruity will increase use of color as a determinant. 


PROCEDURE 


An analysis of variance design was used with 204 female undergraduate students 
at the University of Pittsburgh. Each subject was given an Ishihara color-blindness 
test and was asked if she had ever taken a Rorschach test. No subject was included, 
in computing results, who was color blind or who had at some time taken a Rorschach. 

On the basis of a pilot study, six discrete details from the Rorschach blots were 
selected as most adequate for the experiment. Each of these details is one which 


*This paper is based on the dissertation of the first author, done under the direction of the second 
author, and submitted to the Department of Psychology of the University of Pittsburgh. Dr. John T. 
Cowles, Dr. A. D. Lazovik, Dr. J. T. Morrow, and Dr. George J. Wischner contetioated to the plan and 
execution of the study. 
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Beck ® lists as eliciting a popular response. In addition, the pilot study identified 
these six areas as those with the greatest agreement as to content when presented 
with a congruous color. Each stimulus detail was prepared in a congruous color and 
in an incongruous color. Table 1 lists the blot details and colors used. 


Taste 1. RorscuacH AREAS AND Cotors UsEp 





Area Rorschach Blot Congruous Color Incongruous color 





Animal Vill brown green 
rabbit xX gray orange 
bear skin VI brown violet 
spider x black urple 
bear II brown lue 
dog x brown pink 





Each subject was shown six blots,-and gave one response to each blot following 
instructions similar to those in ordinary Rorschach administration. The exact in- 
structions used“) followed as closely as possible those employed by Siipola“®. 

Timing was controlled by the subject, in order to eliminate experimenter bias, 
in the following manner. The subject was seated in a semi-dark room facing a box 
which contained a one-way vision mirror, with two light bulbs, one on each side of 
the mirror. At a signal from the experimenter, the subject pressed down on a switch, 
automatically reversing the lights, exposing the stimulus card, and starting an 
electric timer. As soon as the subject started to verbalize a response, she released the 
switch so that the stimulus was no longer visible, and the timer stopped automatic- 
ally. Subjects were trained in this procedure before beginning the experiment. A 
detailed description of the apparatus is given elsewhere“). 


The design of the experiment divided the 204 subjects into four groups of 51 


subjects each. Presentation of stimuli for each subgroup was as follows: 
Condition I: three congruous blots, followed by three congruous blots. 
Condition II: three congrous blots, followed by three incongruous blots. 
Condition III: three incongruous blots, followed by three incongruous blots. 
Condition IV: same as Group II, with additional instructions to respond chiefly 
on the basis of form. 


For each group, the first three blots were always the animal, bear skin, rabbit, 
in that order. Each group of 51 subjects was further divided into three sub-groups of 
17 subjects each, so as to provide all possible orders of presentation for the last three 
blots: bear, spider, dog. Table 1 indicates the Rorschach blot from which each detail 
was taken. Results were analyzed in terms of responses to the last three blots. 
Measures used were: reaction time, number of popular (P) responses, and number of 
responses scored for color. 


RESULTS 


Reaction time. The major hypothesis in regard to reaction time predicted that the 
four experimental conditions would result in different reaction times; more specifical- 
ly, color incongruity (Conditions II, III, and IV) would result in longer reaction 
times than would occur with congruous stimuli (Condition (I). The results fail to 
confirm this hypothesis, and do not even show a trend in the predicted direction. 
The mean reaction time in seconds for inkblot stimuli four, five, and six combined 
were: Condition I, 7.28; Condition II, 6.26; Condition III, 8.14; and Condition IV, 
7.38. The mean reaction time for the three incongrous conditions combined was 7.26 
seconds, essentially identical with the mean reaction time of 7.28 seconds for the one 
congruous condition. 

The analysis of variance based on reaction times, of course, showed no sig- 
nificant difference due to conditions. An F-ratio, significant at the .01 level, did indi- 
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cate a difference associated with the specific stimuli: that is, reaction time varied 
with the specific blots presented. This result is not surprising, and has no bearing 
on the incongruity hypothesis. Finally, the different treatments of the first three 
blots had no significant effect on reaction times to the last three blots. 


Content variability. The major hypothesis in regard to content predicted that form- 
color congruity (Condition I) would be associated with an higher number of popular, 
or P, responses than would form-color incongruity (Conditions II, III, and IV). The 
results clearly confirm this hypothesis. The number of P responses (for stimuli four, 
five, and six combined) was roughly twice as many for the congruous condition as 
for any one of the three incongruous conditions. 

Of 153 responses for each condition, the number of P responses was: Condition 
I, 84; Condition II, 43; Condition III, 36; and Condition IV, 41. Chi-square tests 
indicated a difference, significant at the .01 level, for the congruous condition as 
compared with each one of the incongruous conditions separately. Differences be- 
tween the incongruous conditions, compared with each other, were not significant. 
Each of the incongruous conditions produced only about half as many P responses 
as the congruous condition. 


Use of color. The basic hypothesis in regard to responses scored for color as the major 
determinant (CF or C) predicted that such color responses would be more numerous 
with incongruity than with congruity. The results clearly confirm this hypothesis. 
For each incongruous condition, the number of color responses was some four times 
or more the number for the congruous condition. 

Of 153 responses for each condition, the number of CF and C responses was: 
Condition I, 3; Condition II, 26; Condition III, 11; and Condition IV, 29. Chi-square 
tests indicated a difference, significant at the .01 or .05 level, for the congruous condi- 
tion as compared with each one of the incongruous conditions separately. The con- 
gruous condition (I) produced a much smaller number of color responses than any 
one of the incongruous conditions. 

In these results on color responses, we find the only significant result of the study 
indicating a difference among the three incongruous conditions when compared with 
each other, rather than with the congruous condition. One of the incongruous condi- 
tions (III) produced fewer color responses than the other two incongruous conditions 
(II, IV). Chi-square tests indicated that these differences are significant at the .01 
level. Apparently, a subject may give fewer CF and C responses to incongruous 
blots if he has just previously had experience with three other incongruous blots (cf. 
procedure in regard to stimuli presentation under Condition III). 


Other results. The results so far refer to responses on stimuli four, five, and six 
(Table 1). In general, differences in the three preceding stimuli had little effect on 
these results. With one exception, the differences occurred between the congruous 
condition (I) and each one of the incongruous conditions (II, III, IV). The one 
exception was that Condition III produced fewer CF and C responses then either 
Condition II or 1V. The important variable in this experiment, therefore, seems to be 
the congruity or incongruity of the blots, little affected by immediately preceding 
experience with incongruity ‘%5. For this reason, a further check on two of the 
three main hypotheses could be carried out by analysis of the responses to stimuli 
one, two, and three. 

The hypothesis that incongruity would result in longer reaction times than 
would occur with congruous stimuli was again not confirmed. An analysis of var- 
iance, based on responses to the first three stimuli, showed no significant difference in 
reaction times for conditions. The mean reaction time for the one incongruous condi- 
tion (IIT) was 3.45 seconds, as compared with a mean reaction time of 3.83 seconds 
for the three congruous conditions (I, II, IV). As before, an F-ratio significant at the 
.01 level did indicate a difference associated with the specific blots presented. But 
incongruity had no effect on reaction time. 
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The hypothesis that incongruity would result in fewer P responses was again 
confirmed. The one incongruous condition (III) produced only 49 P responses, 
whereas the mean number of P responses for the three congruous conditions (I, II, 
IV) was 82. A chi-square test indicated that this difference was significant at the .01 
level. The congruous condition produced many more P responses than did any one 
of the incongruous conditions. 

The third hypothesis, that incongruity would result in more CF and C res- 
ponses, could not be tested on the responses to the first three stimuli. The 204 sub- 
jects gave only four CF and C responses to these stimuli, too few to permit analysis. 


Discussion 


These results emphasize three considerations pertinent to experimental studies 
of Rorschach ‘color shock’. First, the form-color incongruity hypothesis can be 
demonstrated in artificialized and carefully controlled experiments“: * *), However, 
with ‘‘free responses” and unstructured stimuli (similar to standard Rorschach condi- 
tions), incongruity affects responses in a variable manner, producing results which 
are not consistent and clear-cut“: 7:19. The present study illustrates this point. 

Second, even though we can confirm the incongruity hypothesis, we should not 
accept it as the complete, or major, explanation of the phenomenon of “‘color shock”’ 
as described by Rorschach. The results of the present study confirm the effect of 
incongruity in increasing content variability (fewer P responses), and in increasing 
verbalized attention to color (CF and C responses). If the subject tends to “search” 
for more varied content, and if he gives more attention to the problem of integrating 
color and form, we might expect his reaction times to increase. But, in this study, 
they Ay not. In the relatively unstructured situation, too many variables are in- 
volved. 

Third, Rorschach stressed ‘‘a lack of associations which appears when the colored 
plates are presented’’®: ». '®); and he regarded such “color shock” as a “sign of 
neurotic suppression of affect’’“: »- %) in certain subjects whom he characterized 
in a variety of descriptive phrases. Objective studies have neither confirmed nor 
adequately tested his tentative observations. The factor of incongruity may well 
play an incidental role in triggering the ‘‘drying up of associations’’ which Rorschach 
described in certain subjects only. However, the effect of incongruity may consist 
chiefly in establishing a set: a situation in which the subject feels called upon to pro- 
duce personally meaningful associations incorporating colors which are distasteful ® 
or disturbing to him. Only specified neurotic subjects, or experimentally prepared 
subjects, might then be expected to show a marked “‘lack of associations’’. 


SuMMARY 
The present study attempted to demonstrate the effect of form-color incon- 
gruity with unstructured inkblots and with Rorschach-like responses. Three hy- 
potheses were tested, with the following results: 

1. Incongruity did not increase reaction times. The results do not even suggest 
a trend in the predicted direction. 

2. Incongruity did decrease the number of popular responses. Subjects gave 
approximately twice as many popular responses to stimuli which were form- 
color congruous as they did to stimuli which were incongruous. The differ- 
ence was significant at the .01 level. 

3. Incongruity did increase the use of color as a determinant. With those 
incongruous stimuli which permitted analysis, subjects gave many more CF 
and C responses than they did to congruous stimuli. The difference was 
significant at the .01 level. 

1Responses to the first three stimuli differed from responses to the last three stimuli in two res- 
pects: reaction times were faster for the first three stimuli; and the first three stimuli elicited almost no 


CF and C responses. These differences could well be due to the specific blots involved. However, the 
data do not justify any one explanation. 
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In evaluating the results of this and other studies, the conclusion is reached that 
the incongruity hypothesis has, in general, been supported by studies employing 
carefully artificialized and controlled experimental procedures. However, the effect 
of incongruity is at least very difficult to demonstrate with unstructured materials 
and free responses resembling those employed in the Rorschach. Therefore, the 
hypothesis should not be regarded as an adequate explanation of the phenomenon of 
“color shock” as described by Rorschach 
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NON-VERBAL COMMUNICATION OF AFFECT! 


ROBERT E. MILLER, JOHN V. MURPHY? AND I. ARTHUR MIRSKY 
School of Medicine, University of Pittsburgh 


PuRPOSsE 


The phenomenon of emotional communication between individuals plays an 
important role in several psychodynamic theories ®: *, This affective interplay may 
be observed at the symbolic level in terms of verbal content or at the motor level 
through gross physical behaviors, facial expressions, and posture or vocal manifesta- 
tions. Further, the emotions are expressed physiologically in terms of autonomic 
change. Needless to say, each of these routes of emotional expression varies accord- 
ing to the intensity of the affect and, oftentimes, are concurrently manifest, particu- 
larly in cases of very intense affective experiences. The method to be described in 
this report and illustrated by three experiments, focuses on a motor response as 
evidence that an affective ‘‘message’’ has been communicated and received. 

The rationale for focusing on a motor or behavioral response rather than a 
verbal report or description is as follows: While verbal report has been almost ex- 
clusively emphasized in clinical and experimental studies of emotional expression in 
humans, non-verbal elements have been repeatedly stressed by theorists®). Further, 
clinical observation strongly suggests that affective experiences may be subject to 
distortions and omissions in translation to verbal description. However, the study 


1This research was supported in part by a research grant (M-487) from the National Institute of 
Mental Health of the National Institutes of Health, U. 8. Public Health Service, and in part by a 
grant from the Foundations’ Fund for Research in Psychiatry. 
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of communication through motor responses has proven fruitful. The ethologists 
have studied in elegant detail the communicative acts of animals with marked 
success“: # 1), In contrast, research using verbal reports from human subjects has 
proven discouragingly difficult. Using verbal report in his classic experiments, 
Landis“: ?) demonstrated that facial expressions obtained during traumatic exper- 
iences were not reliably categorized by untrained judges. Only after years of contro- 
versy and contradiction is it generally accepted today that subjects can give reliable 
verbal judgments of affects depicted in posed photographs provided that broad 
categories of judgment are employed “: Chapt. 22), 

The method described in the present paper is designed to investigate non- 
verbal communication of affect using a motor response as evidence that communica- 
tion has occurred. Initial studies using rhesus monkeys as subjects have yielded en- 
couraging results, suggesting that application of the method to human subjects 
might materially advance our understanding of non-verbal communication of affect. 


METHOD 

The method used to study the communication of affect requires that the sub- 
jects be conditioned to perform an avoidance response. In the three studies to be 
described here, an interanimal avoidance response”) was conditioned. In inter- 
animal conditioning one animal serves as the conditioned stimulus for avoidance 
behavior on the part of the second animal. The conditioning is accomplished simply 
by exposing the stimulus animal to the view of the conditioning subject for five 
seconds prior to the onset of a strong electric shock to the feet of the conditioning 
subject. The only means of escape for the conditioning subject from the painful 
shock is by pressing a lever in the conditioning apparatus which terminates both 
shock and sight of the stimulus monkey. As the conditioning monkey learns the bar- 
pressing response, it begins to avoid the painful US by performing the instrumental 
response upon each exposure of the stimulus animal before five seconds have elapsed, 
thus terminating sight of the stimulus monkey and avoiding shock. Data concerning 
the conditioning and extinction of the interanimal avoidance response“ and its 
effects upon group social interactions“: ® have been presented elsewhere. It should 
be emphasized that the stimulus monkey in interanimal conditioning cannot see the 
conditioning subject at any time and, further, at no time receives shock or other 
painful stimulation during interanimal conditioning. 

The communication of affect tests are conducted during or following experi- 
mental extinction of the interanimal avoidance response. The stimuli which are pre- 
sented are of two kinds: (a) Exposures of the conditioned stimulus for the avoidance 
response, t.e., calm, non-fearful monkeys, and (b) exposures of monkeys responding 
to a pain /fear eliciting stimulus. The theory of stimulus generalization would predict 
that any difference in incidence of avoidance behavior between these two classes of 
stimuli would be in the direction of greater responsiveness to the CS. A significantly 
greater response to the exposure of fearful monkeys would suggest that such stimuli 
elicit fear in the subject and thereby trigger the fear-motivated avoidance response. 

Three experiments which illustrate the method will be briefly presented. Ex- 
periment I demonstrates that sight of a monkey behaving fearfully will reinstitute 
the avoidance response after extinction of that response to zero level. Experiment 
II demonstrates communication of affect to full-size color pictures of fearful animals. 
Finally, Experiment III demonstrates the phenomenon with pictures of facial ex- 
pressions only. The first of the three experiments has been considered in a previous 
paper 6). 


RESULTS 


Experiment I. In the initial communication of affect experiment“) the inter- 
animal conditioned avoidance response was first established and then extinguished 
to zero level’). Following a period of rest in the home cage (median 23 days) the 
seven experimental animals were reintroduced into the test apparatus and the 
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spontaneous recovery of the conditioned avoidance was again extinguished to zero 
level. The next day the communication of affect test was begun. The animals were 
again placed in the conditioning apparatus. On these trials, however, the shock 
unconditioned stimulus was delivered not to the test subject but to the stimulus 
animal. Thus, the test monkey was presented with exposure of the stimulus monkey 
responding affectively to a painful situation. The stimulus animal could not escape 
or avoid the shock. A bar-pressing response on the part of the test subject termin- 
ated the shock in the stimulus compartment as well as removing the stimulus mon- 
key from view. 

The results indicated that sight of a monkey responding affectively to a painful 
stimulus significantly (p <.05) reactivated the extinguished avoidance response. 
The test monkeys pressed the bar upon exposure of the fearful stimulus monkey al- 
though they did not receive any punishment themselves. Further, the fearful be- 
haviors of the stimulus monkey in anticipation of shock were as effective in eliciting 
avoidance responses in the observer as behaviors of the stimulus monkey while 
actually receiving shock. 

Since avoidance learning is, according to Mowrer “?, motivated by fear and re- 
inforced by fear-reduction, the occurrence of a conditioned avoidance response pre- 
sumes the presence of fear and /or anxiety in the conditioning subject upon presenta- 
tion of the stimulus. Similarly, extinction of a conditioned avoidance response sug- 
gests that fear of the conditioned stimulus has diminished. From this theoretical 
framework one would interpret the findings of the communication of affect experi- 
ment to mean (a) that rhesus monkeys can clearly discriminate affective behaviors 
expressive of fear and/or pain in other monkeys, and (b) that recognition of such 
behaviors in other animals evokes fear in the viewer. 


Experiment II. In Experiment I a multiplicity of expressive cues were present 
from the stimulus monkey. The animal responded vigorously to shock in several 


stimulus dimensions, 7.e., movements, vocalization, facial expressions, etc. Experi- 
ment II was a first step in attempting to identify and to specify the relevance of 
these various expressive behaviors. Thus, Experiment II investigated the communi- 
cation of affect to 35 mm. color transparencies of fearful vs. non-fearful monkeys. 
These photographs were whole body slides showing facial expression and posture and 
were projected life-sized in the conditioning apparatus. As in the initial experiment, 
the test subjects (N = 8) had been conditioned to avoid shock by pressing a bar upon 
exposure of the conditioned stimulus, in this case slides of a non-fearful monkey. 
The conditioned avoidance response was extinguished to the 40 per cent level before 
the communication of affect tests were begun. Four categories of stimuli were pre- 
sented in the communication of affect tests: (a) stimulus animal - not fearful, (b) 
stimulus animal - fearful, (c) other animals - not fearful, (d) other animals - fearful. 
An extinction procedure was used throughout the communication of affect tests, 7.e., 
no shock was presented for failure to respond. The slides were presented in a Latin 
Square sequence which was repeated until less then 10 per cent avoidance occurred 
to each of the categories. Two additional determinations of the communication of 
affect phenomenon were made. One was a spontaneous recovery measure while the 
other was made immediately after reconditioning with the avoidance response at the 
90 per cent level. 

It was found that there were significant differences (p < .02) in the number of 
avoidance responses to extinction between the four stimulus categories. The picture 
of the stimulus animal in fear and/or pain poses elicited more avoidance responses 
than the pictures of the same animal in non-fearful poses. This finding is quite re- 
markable in view of the fact that ‘“‘stimulus monkey - not fearful” was originally the 
CS for avoidance learning and had actually been paired with painful shocks. In 
similar fashion, pictures of “other monkeys - fearful” elicited more responses than 
did ‘‘other monkeys - not fearful”. The results of this experiment indicated that 
posture and facial expression are sufficient cues for the communication of fear and /or 
anxiety between rhesus monkeys. 
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Experiment III. The third experiment has only recently been completed. 
Briefly the same procedures were followed, but in this study facial expressions only 
were the stimuli. When slides of facial expression in calm and fearful monkeys were 
presented in the communication of affect method, seven test animals performed sig- 
nificantly (p < .02) more avoidance responses to fearful than to non-fearful facial 
expressions. 


SUMMARY 


These experiments have demonstrated that the affect of fear and /or anxiety can 
be perceived and discriminated by rhesus monkeys in the facial expression and post- 
ure of other monkeys. The enhancement of avoidance behavior by presentation of 
pictures of monkeys reacting fearfully indicates that exposure of a fearful stimulus 
animal elicits, through some empathic process, fear in the viewer. Pilot experi- 
ments ®) have suggested that the monkey will not respond fearfully to exposure of 
other species of animals responding affectively to painful stimuli. 

The present studies are only first steps in the identification of non-verbal com- 
municative cues. It is possible, indeed probable, that other expressive modalities 
such as movement and vocalizations are also effective in the communication of var- 
ious affects between primates. 

These experiments, in addition to providing data concerning the communica- 
tion of affects in sub-human primates, suggest that similar methods might be utilized 
to investigate the empathic response to affect in man. As the study of facial ex- 
pression of emotions has clearly indicated“: ?: ), verbal descriptions of the affective 
qualities of faces present categorization and interpretative problems. Indeed, the 
very act of verbalizing may serve to vitiate or conceal the affective manifestations 
involved. It is possible that the use of conditioned and /or autonomic responses of 
subjects upon exposure of emotionally expressive faces would provide less ambiguous 
measures of the communication of affects. It may well be that physiological indi- 
cators of affect could be of key importance in the investigation of affective communi- 
cation in man. Information regarding this very important phenomenon would con- 
tribute to psychodynamic theory and to current clinical procedures and observations. 
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THE HUNTER-PASCAL CONCEPT FORMATION TEST: 
AN EXPERIMENTAL APPROACH TO THE MEASUREMENT OF 
CORTICAL CAPACITY? 


G. R. PASCAL AND W. O. JENKINS 
University of Tennessee 


This paper presents experimental results with a test for human organisms which 
has its origin in the work of the animal laboratory. Hunter studied the symbolic 
behavior of the infra-human organism and developed the delayed reaction and the 
double alternation problems “: 7 * % 1®, Later work with these two problems is 
summarized by Stone? and Munn“®, Performance on both problems has been 
shown to be related to position on the phylogenetic scale, using animals from fish to 
humans, and ontogenetically, to maturational level in human subjects. “: * 5. § 1 1, 
14, 20, 21, 22, 24) 

Using these two problems as a basis, a test has been developed which is useful 
with human subjects from infancy to old age, free of the visuo-motor requirements 
of most tests of concept formation, and relatively free of cultural influences. The 
biological bases of the problems used in the test suggest that performance on the 
test is measuring some aspect of human capacity not previously tested systematically. 
Experimental evidence is available which indicates a relationship between perform- 
ance on the basic problems and cortical capacity “: !: 17) 18, 28), 


DESCRIPTION OF THE TEST 


The apparatus used for all the problems consists of a flat vertical wall with 
five doors equidistant from each other of exactly the same outward appearance, but 
distinct from the rest of the wall and located at the bottom of it. In the outside 
dimensions, the apparatus is 18” long by 12” high by 10’ deep. It is made of 3/8” 
plywood. The doors are 244” by 314” and hinged at the top so that they swing in- 
ward to the E’s side of the apparatus. Plywood dividers separate each door on E’s 
side. The apparatus is a modification of one previously described °*. The doors are 
numbered on the examiner’s side of the apparatus, one to five beginning on the ex- 


Fig. 1. Apparatus For H-P Trsr 
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1We are very grateful to the following students for their contribution to the development of this 
test: D. Bidus, P. Caro, M. Dinoff, M. Friedman, 8. Gelburd, H. Salzberg, J. Openshaw, B. Sargent, 
C. Walter, 8. Wechkin, G. Yanagi. Special thanks are due to Dr. Clifford H. Swensen and Herman 


Salzberg for constructive criticisms of the manuscript. 
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aminer’s left (Fig. 1). In all the problems 8’s task is to find a reward placed behind 
one of the doors. There are, in all, nine problems as follows: 

Problem I - Immediate Reaction. In this task S sees the placement of the reward behind a 
pre-determined door and is immediately allowed to search for it. A total of ten trials is allowed 
with each trial consisting of one presentation. 

Problem II - Delayed Reaction 1 (short). Here, 8 sees placement of the reward behind a pre- 
determined door, is turned around through 360 degrees and is then allowed to search for the re- 
ward. Ten trials are given as with Problem I. 

Problem III - Delayed Reaction 2 (long). 8S sees the reward placed behind a pre-determined 
door, as in Problem II, then is distracted for a period of one minute and then allowed to search for 
the reward. Again, ten trials are given. 

Problem IV - Delayed Reaction 3 (Single Alternation Cue). In this task S is shown a dis- 
tinctive stimulus such as a poker chip in front of the middle door. He is then turned with his back 
to the apparatus. During this delay interval, the reward is pone behind one of the end doors 
(door 1 or 5) unbeknown to § and the poker chip is placed in front of this same door. 8 is then 
allowed to search for the reward. Subsequently, the reward is placed, in a similar manner, behind 
the other end door. This alternation sequence continues until S solves or fails to solve the prob- 
lem (see below), allowing a maximum of ten trials. This problem is a transitional one between 
delayed reaction and the single alternation to follow. 

Problem V - Single Alternation. Here, the reward is placed behind end doors alternately 
without 8’s being aware of where the reward is put. A total of ten trials is allowed. Each trial 
consists of two presentations, one to each end door. 

Problem VI - Double Alternation. S’s task, here, is to solve the double alternation problem 
learning the ——— that the reward is hidden twice behind one end door and then twice behind 
the other end door. A total of ten trials is allowed. Each trial consists of four presentations, two 
to each end door. 

Problem VII - Serial Reaction I. This problem consists of a combination of double and triple 
alternation. Each trial consists of five presentations. 8’s task is to learn the correct sequence of 
reward placement. A total of ten trials is allowed. 

Problem VIII - Serial Reaction 2. This task involves a combination of single and triple 
alternation. Each trial consists of six presentations. S must learn the correct sequence of reward 
placement. A total of ten trials is given. 

Problem IX - Serial Reaction 3. This problem is a combination of single and double alterna- 
tion consisting of eight presentations per trial. S’s task is to learn the correct sequence of reward 
placement. Ten trials are allowed. 


The criterion of success on each problem is two consecutive correct trials. Re- 
wards are used for all Ss and vary with the age and cultural background of S. It has 
been found that corn candy or some such objects work well for young children. 
College students and average adults work for poker chips. Psychotics will perform 
well when rewarded more materially by such things as money or cigarettes. 

There are several available indices for scoring the H-P Test, e.g., pass-fail by 
problems (the traditional method), total number of errors, and total number of 
trials. However, we have found that a sensitive measure of overall performance on 
the test is the Presentation Score. Pass-fail correlates almost perfectly with pre- 
sentation score“), Error scores correlate .75 with presentation score“), The Pre- 
sentation Score is obtained by totalling the number of presentations required across 
all the problems. If S succeeds on a problem, his Presentation Score consists of the 
number of presentations prior to the criterion for that problem. If § fails, he is given 
the number of presentations possible for the problem, e.g., if he fails Double Alter- 
nation, he receives a Presentation Score of 40. The highest possible Presentation 
Score is 290 which indicates failure on all problems. Thus far such a score has been 
obtained only by children below the age of 10 months. 


RESULTS 
Approximately 500 cases have been tested ranging in age from four months to 
55 years. However, not all of these were tested on exactly the same problems, 
although the problems were of the same general nature. In reporting distribution 
data, therefore, the total number of cases tested cannot be used. The distribution 
data shown in Table 1 (Fig. 2) are based on 175 cases using the latest revision of the 
test. Although the function seems to be linear within the age range tested, other 
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TasLe 1. Distrrvution Sratistics oN H-P Test PRESENTATION SCORE 





Age Mean Median Range 





0-2 278 25 
2-4 260 20 
4-6 254 10 
6-10 229 108 
10-14 183 207 
14-18 139 198 
18-22 102 101 162 





evidence suggests that when older age groups are added it will eventually turn out to 
be parabolic. Based on the individual data summarized in Table 1 the correlation 
between age and Presentation Score was .83. 

It will be noted in Table 1 that as age increases there is a tendency for an in- 
crease in variability. Table 2 may offer a partial explanation for this finding since 


TABLE 2. CoRRELATIONS BETWEEN PRESENTATION SCORE AND CHRONOLOGICAL 
AND MENTAL AGE 





Age 
Investigator Range N C.A. M.A. 





Collins-Tapp % 2) 5-60 mos. 46 86 
Kaiman (2) 4-66 mos. 70 92 .54 (N = 30) 
Larsen ‘!*) 7 yrs., 3 mos. to 

18 yrs., 11 mos. 60 .58 76 

18-22 yrs. 45 .07 





it seems that for the first 66 months of life most of the test variance is accounted for 
by chronological age. Notice that from 66 months of age to 18 years, 11 months, the 
correlation coefficient between CA and H-P score drops from .92 to .58. Correspond- 
ingly, the correlation with mental age increases from .54 to .76. We have obtained a 
correlation of —.01 between Otis IQ and Presentation Score in a sample of 45 female 
college sophomores. It seems that by adulthood some variable other than IQ is 
affecting performance on the H-P Test. We do not mean that H-P Test performance 
is not correlated with IQ when the whole range of IQs (including mentally deficient 
Ss) is included. However, within the range of so-called ‘“‘normal’’ IQs, our data indi- 
cate little relationship between IQ and H-P Test performance. 


Fia. 2. RELATIONSHIP BETWEEN C.A. and H-P PresenTaTIoN Score. 
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One variable which may be influencing H-P Test performance is suggested by 
Leslie’s findings“. Leslie ran two samples of 25 first admissions to a mental hospital 
and compared H-P performance with the final diagnosis as made by a team of psy- 
chiatrists. Using the grand mean as a point of division, he found that 82 percent of 
psychotics and only 21 percent of neurotics obtained high Presentation Scores. For 
the second sample, 90 per cent of psychotics and only 33 per cent of neurotics ob- 
tained high Presentation Scores. For all 50 cases a biserial r of .80 was obtained. 
Worley? did a similar study at a different mental hospital and obtained non- 
overlapping distributions for the performance of psychotics and neurotics. Jensen “!? 
obtained comparable results. In a different setting, Davis®) contrasted the per- 
formance of 14 skid-row alcoholics with that of 14 controls matched for age, sex, IQ, 
and socio-economic status. The scores of alcoholics exceeded that of their matched 
control partners in every case except one in which there was a tie and another in 
which there was slight reversal. All of these studies showed an insignificant correla- 
tion between IQ and H-P performance. 

With respect to the test, itself, we have some data on the test-retest reliability. 
Kaiman“*) retested 15 children of his target population after approximately two 
months and obtained a reliability coefficient of .84. Another possible estimate of the 
test reliability may be obtained by contrasting performance on the first half of the 
problem failed (and the last problem administered) with that of the second half. 
Kaiman computed this correlation for his 70 subjects and obtained a coefficient of 
.86. For Ss from the ages of 7 years, 3 months, to 22 years, the time required to 
administer the test averages 21 minutes, with a range of 17 minutes“*?. 

There has been much discussion of the maturational difference between the 
sexes. Kaiman“?) obtained a coefficient of .02 when contrasting test performance 
based on sex differences. Larsen“) found similar results. One of our students ad- 
ministered the test to blind-folded Ss and found no significant difference between 
their scores and those of their controls. Another administered the test to Chinese and 
Korean students in the U. 8. and found no significant difference between their scores 
and those of matched American students. Performance on the test does not correlate 
with course grades in a college population. When two separete samples were tested, 
a class of 45 students in psychology and a class of 40 students in mathematics, no 
significant relationship with course grades was obtained. 


Discussion 


The H-P Test consists of two fundamental problems, the delayed reaction and 
the alternation problem. Performance on these problems has been found to be re- 
lated to phylogenetic development and maturation in human Ss. It would appear 
that some fundamental capacity of the organism is being assessed by the test. At 
the moment, the precise nature of this capacity is an experimental question. What 
ever it is, it does not seem to be tapped by the usual tests of intelligence, at least in 
the adult population. Some of the common psychological constructs that could be 
applied to the ability to perform on the test are freedom from distractibility, learn- 
ing, retention, and flexibility. 

The first of these, freedom from distractibility, leads us to speculate about a 
possible relationship between performance on the H-P Test and the findings that 
have emerged from extirpation studies with monkeys using the delayed reaction and 
similar tests. In Harlow’s words, ‘“‘Delayed-response type tests—which may measure 
memory but which we have reason to believe basically measure some attentive func- 
tion—are primarily impaired by lesions in the frontal association areas” “: P. %), It 
seems possible to hypothesize, very tentatively indeed, that performance on the 
H-P Test may be related to the efficiency of frontal association areas. A similar 
hypothesis has been made previously “?. 

It is intriguing to us that problems developed for use with infra-human Ss not 
only lend themselves for use with human 8s of all ages, but also yield orderly data. 
We hope that this test will be the forerunner of measures of human capacity based on 
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the findings of the experimental laboratory oriented toward psychology as a biologi- 
cal science. It seems obvious that some capacity of the human organism is being 
measured by the H-P Test, but its nature and importance is a matter for future 
research. A preliminary manual containing instructions for administration and 
scoring is available upon request from the authors. 


SUMMARY 


This paper reports the development of a test of human capacity based on con- 
cept formation problems previously used with infra-human subjects. Data are pre- 
sented showing the relationship between test performance and CA, MA, IQ, sex, 
and the psychotic-neurotic dichotomy. Some statistical characteristics of the test 
are reported. The test appears to be estimating some aspect of cortical capacity not 
measured in the usual tests of intelligence. 
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PREDICTION OF FIRST GRADE SCHOOL ACHIEVEMENT WITH THE 
BENDER GESTALT TEST AND HUMAN FIGURE DRAWINGS 


ELIZABETH M. KOPPITZ, JOHN SULLIVAN, DAVID D. BLYTH AND JOEL SHELTON 
Children’s Mental Health Center, Columbus, Ohio 


PROBLEM 


Learning problems in young children are frequently not fully recognized until 
the second or third grade or even later. By that time the child, the parents and the 
teacher tend to be quite frustrated and the emotional problems developing out of the 
situation tend to intensify the difficulties. If school achievement could be predicted 
at the time of school entrance then special consideration could be given to youngsters 
with potential learning problems and many later frustrations avoided. 

Earlier studies“ *? have shown that school achievement in the first grade is 
closely related to visual-motor coordination and perception as measured on the 
Bender Gestalt Test“. However, school achievement depends also on a number of 
other variables such as intelligence, motivation for learning, emotional adjustment, 
home environment and socio-economic status. Good prediction of school success 
must take as many of these factors into account as possible®’. The Human Figure 
Drawing Test has been found to be quite sensitive to many of these factors and 
could therefore also be a good indicator of school achievement. This study tested 
the hypothesis that the Bender Gestalt Test and Human Figure Drawings together 
can predict first grade school achievement. If this hypothesis is valid then the 
Bender and Drawings could be used as screening instruments at the beginning of 
the school year for the detection of potential learning problems. 


METHOD 


Subjects. The Ss were 143 first grade students from six different classrooms repre- 
senting a socio-economic cross-section of the Greater Columbus area. 


Class A was taken from a lower class urban neighborhood; 

Class B was taken from a striving middle class suburban community; 
Classes C and D were taken from an upper middle class urban community; 
Class E was taken from a middle class small town community; 

Class F was taken from a lower middle class rural area. 


Although all children present in each classroom were tested, only those on whom 
complete data were available at the end of the school year were ultimately included 
in this study. Table 1 shows the number of Ss in each classroom. The mean age for 
all Ss was six years and three months.* 


Tests. The three tests used in this study included (a) the Bender Gestalt Test; (b) 
Human Figure Drawing Test; (c) the Metropolitan Achievement Test, Primary I 
Battery, Form R. ; 


(a) The Bender was administered individually to each S by one of the investigators. The 
test protocols were scored according to the Koppitz scoring system® 4) which was developed for 
the purpose of diagnosing children with learning problems. This scoring system includes four 
major categories with 31 separate items. Since only deviations on the Bender are scored, a high 
score indicates a poor Bender, while a low score indicates a good Bender. 


(b) The Human Figure Drawing Test was administered by the teacher to each class as a 
group. The Ss were requested to draw ‘‘a whole person”. The drawings were scored according to 
a tentative scoring system developed on Classes A, B, C, and D of this study. This was done by 
comparing the drawings of the Ss whose actual achievement was either above or below the achieve- 
ment predicted on the basis of the Bender with the drawings of those Ss whose actual and predict- 


*The authors wish to thank Mr. Russel Slater, Mr. James Kirkpatrick, Mrs. Jenny Raduege, and 
the principals and teachers for their cooperation in making this study possible. 
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ed achievement was the same. All those items on the drawings which differentiated the two groups 
of Ss were combined into a tentative scoring system. The interpretation of these items was b 

on clinical observation and on the findings of others rE Machover, Goodenough and 
Buck. The scoring system was cross-validated on Classes E and F of this study. 

The ee system for Human Figure Drawings consists of 3 scoring categories. The first 
one is associated with poor or reduced achievement due to emotional upset and/or lack of mental 
ability. It includes 12 scoring items which are scored negatively: 

Three or more figures drawn spontaneously 
Grotesque, non-human figure 
No body 
No arms 
No eyes 
No pupils, vacant eyes 
No mouth 
Tiny drawing (2” or less) 
Figure slanting by 30° or more 
Smudges, unfinished drawing 
Arms rigidly clinging to body 
Drawing grossly organic 
The second category includes six items which seem to reflect pressure for achievement and/or 
ng striving and which appear to be associated with good or improved school achievement. 
ese items are scored positively: 
a. Head much larger than body 
b. Ground, grass, flowers or tree spontaneously added 
c. Sky, sun or clouds spontaneously added 
d. Teeth 
e. Excessively long neck 
f. Glance of eyes, cross-eyed 
The third scoring category includes three items which are usually found in drawings of older 
children. When they do occur in the drawings of first graders they indicate above average intelli- 
gence. These items are associated with good achievement and are scored positively: 


I. Movement of figure 
II. Head in profile 
III. Joints in knee or elbow 


The positive and negative Drawing scores are added to a basic score of 10; thus a high score tends 
to be associated with good achievement and a low score with poor achievement. 


(c) The Metropolitan Achievement Test, Primary I Battery: Form R was administered ac- 
cording to standard procedure in groupe by the classroom teachers. The tests were scored by the 
investigators and the scores conve into grade level ratings according to the norms furnished 
by the publishers®), 


QP > 


Bo 


Pris to 


Procedure. During the fourth to sixth week of the school year the Bender was ad- 
ministered to all Ss individually. Within a-week of this administration Figure 
Drawings were obtained from each class as a group. Thereafter the Benders and 
Drawings were scored.t Seven months after the first two tests were given, the Metro- 
politan Achievement Test was administered to each class by its teacher. The test 
scores were then first correlated with the Bender scores alone and later with the 
Bender and Drawing scores together in a Multiple Correlation. 


RESULTS 


Table 1 shows the means and standard deviations of the various measures used 
in this study according to individual classes and for the whole group. The mean 
Bender scores show a marked difference between the classrooms of varying socio- 
economic status. Thus the mean Bender score of class A is about six months below 
the mean score for the total group. It resembles more the scores of kindergarten 


tSincere thanks are due to Mrs. Amy Furuta for her patient and conscientious scoring of the 
test protocols. 
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TaBLE 1. Mean Scores aND STaNDARD DeEvIATIONS oF TESTS AND CORRELATIONS 
BEetwEEn TEsts 





Classroom D F Total 





E 
27 


A 
N 23 
Bender Gestalt 


Mean Errors 
8.D. 


Figure Drawings 
Mean re 
8.D. 


Metro. Achiev. Test 
Mean Reading Achiev. 
Mean Number Achiev. 
Mean Average Achiev. ; : 
8.D. .49 .48 

Correlations! 

Bender & Mean Ach. -.44* -.41* 
Draw & Mean Ach. .23 .52** .44** 
Bender, Draw & Ach. F .62** .59** 





*Significant at the .05 level. 
**Significant at the .01 level or better. 


1The scoring of errors on the Bender results in negative correlations. 


children than of first graders. The mean Bender scores of Classes C and D, on the 
other hand, are about six months ahead of the group average. Their Benders are on 
the level of seven year old children. The mean Bender scores for Classes B, E and F 
were similar to the average scores for the whole group. 

On the Drawing Test, classes A and F showed mean scores below the group aver- 
age, while class B revealed the highest drawing score. The other three classes ap- 
proximated the average score for the group as a whole. However, the average score 
of 10 might represent an absence of both positive and negative scores, or it may 
represent a balance of positive and negative scores. Because of this, an analysis of 
the distribution of Drawing scores is given on Table 2. 


TaBLe 2. PERCENTAGE OF SUBJECTS WITH NEGATIVE AND PositIvE DRAWING 
Scores 








Negative Scores Positive Scores 
Emotional or low IQ High Motivation High IQ 


78 
45% 


28% 
50% 
56% 


45% 
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Table 2 indicates that classes A and F have a much higher percentage of Ss with 
negative scores than with positive emotional scores. Only very few of the Ss in these 
classes show above average endowment. Classes B and E reveal many bright child- 
ren and a remarkably high incidence of emotional pressure and motivation for 
achievement along with a rather large amount of negative emotional indicators. 

Class C seems to be the best balanced class in this study. It shows many bright 
Ss with a moderate incidence of both positive and negative emotional indicators. 

Class D is a most unusual class which deviates from the other five classes in all 
results of statistical computations. The unexpected results may be in part ex- 
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plained by the findings shown in Table 2. The Ss in this class seem to be unusually 
well endowed, 7 out of the 17 Ss showed superior intelligence. In addition they 
showed a very high incidence of emotional negative signs and unusual pressure for 
achievement. The overloading in all three of these areas made predictions of achieve- 
ment for this class difficult. 

The mean scores on the Metropolitan Achievement Test (Table 1) for the whole 
group was 2.1 or 2 months above the expected national norm at the end of the first 
grade. The mean scores for each class parallel the mean Bender scores; that is, class 
A’s score was about six months below the group average while classes C and D were 
about six months above the group mean score. The other classes approximated the 
average achievement score. Thus each class maintained its relative position within 
the total group throughout the first year of school. 

Table 1 gives the correlations between the Average Achievement scores and the 
Bender and Drawing scores as well as the multiple r of the Bender and Drawings to- 
gether with the Achievement scores. Both the Bender and Drawings were found to 
be significantly related to achievement for the group as a whole as well as for each 
class individually with the exception of class D. However, the multiple r for the two 
tests was consistently higher than the correlations for the separate tests. Thus it was 
shown that the Drawings contribute factors to the prediction of first grade achieve- 
ment which are not measured by the Bender. These findings strongly support the 
hypothesis that the Bender and Human Figure Drawings together are useful pre- 
dictors of school success and failure in the first. grade. 

The multiple r’s for the individual classes were obtained through computations 
employing the Beta Weights derived on Classes A, B, C and D. Direct application 
of these Beta Weights for individual prediction for each 8 cannot be achieved without 
correction for the differences between classes which exist and are not measured by 
the variables tested, such as te2zcher ability and attitude, verbal intelligence, etc. 

TaBLE 3. CORRELATIONS OF THE BENDER WITH THE METROPOLITAN 


ACHIEVEMENT Test, THE DrawinG Test, AND THE AGE FOR ALL 
SuBsEcts 





Bender! 





Metropolitan Reading Achievement -.47** 
Metropolitan Number Achievement -.54** 
Metropolitan Average Achievement -.55** 
Human Figure Drawing Test -.27** 
Age (excluding repeaters) -.10 





1The scoring of errors on the Bender results in negative correlations. 


*Significant at the .05 level. 
**Significant at the .01 level or better. 


Table 3 shows the relationship of the Bender to Reading Achievement, Number 
Achievement, the Human Figure Drawing Test, and to age as such, for the group as 
a whole. The results show that the Bender is somewhat more closely related to 
Number Achievement than to Reading Achievement, which confirms earlier findings 
), The relatively low although still significant correlation between the Bender and 
the Drawings confirms the assumption that the two tests measure primarily different 
factors and supplement each other. No significant relationship was found between 
age and the Bender; none of the Ss repeating the first grade were included in this last 
computation. 

Test results show that the Bender and the Drawings can predict achievement 
for groups of first grade students quite well. However, there were a few individual 
children who achieved considerably more or less than was anticipated on the basis of 
their Bender and Drawing scores. An analysis of these special cases shows some very 
young and very bright children among the over-achievers who seem to have been 
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able to catch up with their classmates in the course of the year although they were 
still quite immature at the beginning of school. Age by itself may not be important, 
but age together with intelligence is significant. Among the under-achievers we find 
older children who were repeating the first grade and who had more mature visual- 
motor coordination than their younger classmates, but limited intelligence made it 
difficult for these children to maintain their head start. On the other hand, some 
very aggressive, bright and verbal children with poor visual-motor coordination 
learned to compensate for their problem during the year and did fairly well academ- 
ically. Ss with speech and hearing problems did less well than was predicted on the 
basis of the non-verbal tests. These findings show the need for the addition of a short 
verbal test for even better prediction of school achievement. Also, further refinement 
of the scoring items currently employed is expected to increase prediction. 


SUMMARY 


This study tests the hypothesis that the Bender Gestalt Test and Human 
Figure Drawings administered at the beginning of the first grade can predict school 
achievement at the end of the year. The Ss were 143 children from six beginning first 
grade classes. All Ss were asked to draw ‘‘a whole person” in the classroom and were 
administered the Bender individually. The tests were scored according to the Kop- 
pitz system and were then correlated with scores from the Metropolitan Achievement 
Test, Primary I Battery, Form R which was administered seven months after the 
first two tests. The results show that the Bender and the Drawings both have the 
ability to predict achievement, but this power increases when they are used together 
as multiple predictors. 

On all three tests used in this study marked differences were found between the 
scores of the various classrooms. These differences seem to reflect in part socio- 
economic and cultural variables. Thus the results of intelligence tests given by the 
school indicate that more than half of the lower class children in our study have at 
least normal endowment, but only very few of these children showed even low 
average achievement. This may be largely due to immature perception and coordina- 
tion reflecting poor physical care, to the absence of motivation for learning, and the 
abundance of emotional problems as revealed in the Bender and Drawings. 

Middle class children were found to be under considerable pressure for achieve- 
ment which seems to result in good academic success but also in many emotional 
problems. They seem to be functioning up to the level of their capacity. Upper 
middle class children were found to have good intelligence and showed a balance of 
both pressure for achievement and emotional problems. Their test performances 
were outstanding and placed them about six months above the Average performance 
for the six Classes as a whole. 
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INDIVIDUAL DIFFERENCES IN RESPONSE VALIDITY! 
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PROBLEM 


A source of discouragement concerning personality tests is the low validity co- 
efficients generally obtained. As Cronbach and Gleser state, ‘Tests of leadership 
social and emotional adjustment, or creativeness rarely boast validity coefficients 
beyond .30.”’@: ». 3) It has been argued previously“? that personality items, even 
when they are objective questionnaire items, are particularly susceptible to the 
effects of the testing situation and irrelevant personal characteristics of the test- 
taker, since it was found that the tendency to acquiesce had a significant influence 
on the validity of a predictor of adjustment to flying training. The present study is 
an attempt to define another personality factor influencing the validity of measures 
of adjustment. 

The direction of investigation was influenced by Eysenck’s“) evidence that 
there are two primary independent dimensions of personality, a neuroticism factor 
reflecting degree of adjustment, and an introversion-extraversion factor reflecting 
type of adjustment. The introversion-extraversion dimension has also been called 
the psychasthenia-hysteria dimension, and this latter designation will be used in this 
article: The introvert, or psychasthenic, is described as neat, careful or accurate, 
tending to worry, and using intellectualization as a primary defense mechanism. 
The extravert, or hysteric, has been characterized as impulsive, careless or unaccur- 
ate, tends to avoid the expression of unpleasant thoughts or emotions, and uses 
denial as a primary defense. 

Possibly these characteristics have an influence on response tendencies in a 
testing situation. Perhaps the psychasthenic is more likely than the hysteric to admit 
negative feelings; perhaps in a situation where test scores could have an adverse 
effect on the subject’s career the hysteric might be more careless, and thus less con- 
sistent in his attempts to ‘fake good’’. 

Personality and performance tests, as well as criteria of adjustment, had been 
collected on a large number of Air Force flying personnel as part of a pilot selection 
research program. These data made possible an investigation of the hypothesis that 
personality traits are related to test validity. They also limited the scope of the 
study, in that, of the traits which have been said to define the hysteria-psychasthenia 
dimension, a measure of only one, the aspect of carefulness or accuracy, was derivable 
from the data. Therefore, the specific question raised in this study was whether 
degree of accuracy on a paper-and-pencil test (the McKinney Reporting Test) 
would be related to the validity of two tests predicting adjustment (the MMPI 
and Cornell Index). 


PROCEDURE 


Subjects. Two samples were used in this study. One was used to ascertain whether a relation- 
pac er between test validity and personality type, and the second was used to validate the 
relationship. 

Sample 1 consisted of 744 air cadets, in pilot training at Graham Air Base, Mariana, Florida 
in 1954-55. A large number of training level data were factor analyzed, and two factors which 
seemed to be related to personality and motivation variables were combined to form an Adapta- 
bility Index. The index represented a weighted composite of Buddy Ratings, Military Aptitude 
Ratings, Demerits, and Instructor Ratings, the weights being based on the factor analysis. The 
sample had been subdivided into three matched groups for the purposes of other research, and 


1Acknowledgment is due Dr. David K. Trites, Dr. Albert L. Kubala, and Mr. Bart Cobb, who 
developed the criteria and formed the matched groups in sample 1; Dr. John R. Barry, who developed 
the Combat Crew Training groups; and Dr. Saul B. Sells, who guided the research project of which 
this study is a part. 

*Now at Western Psychiatric Institute, University of Pittsburgh Medical School. 
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this division was maintained in this study. The subsamples (designated A, B, and C) were match- 
ed on (a) pass-fail ratio, (b) class- an equal number from each training class was in each sub- 
sample, (c) age, (d) Adaptability Index, (e) failure cause, based on the findings of the Faculty 
Board which determined whether the trainee should be washed out, (f) Cornell Index score—the 
distribution of scores was roughly equivalent in each subsample. 

Sample 2 contained three groups, each consisting of trainees who were at the extremes on the 
adjustment criterion. These groups were constructed as follows: 

(a) GAB Cadets. This group was selected from 472 aviation cadets in flight training 
at Gr Air Base in 1953-54. An estimate of level of adjustment was made on each cadet 
by a psychologist using interview material, training outcome, grades, medical records, and 
instructor ratings of officer potential. The method of combining these data to obtain the 
estimate of adjustment has been described elsewhere“). On the basis of the estimates, 100 
who appeared well adjusted, and 100 who seemed poorly adjusted were selected from the 
total 472 cadets, to form a high-low adjustment group. 

(b) CCT Pilots. This group was selected from 428 co-pilots enrolled in Combat Crew 
Training at Randolph Air Force Base in 1951-52. Ratings of adjustment were made by a 
clinical psycholegist based on interview material, projective tests, training records, medical 
records, and sociometric peer ratings on leadership, confidence, and pons @), Ninety- 
six co-pilots with high ratings of adjustment, and 92 with low ratings were selected to make 
up the high-low adjustment group. 

(c) CCT A/Cs. This group was selected from 338 aircraft commanders who were en- 
rolled in the same Combat Crew Training course as the CCT Pilot group. The ratings of 
adjustment were estimated in the same manner as for the CCT Pilots. Ninety-seven air- 
craft commanders with high, and 93 with low ratings of adjustment comprised the high-low 
adjustment group. 

Design. The ene Reporting Test was used to classify the subjects along the hysteria- 
psychasthenia dimension. This is a paper-and-pencil test requiring the counting of small details 
in an accurate and speedy manner. Midway in the test a stress period is introduced involving the 
threat of failure. Another study using this test“) indicated that the difference between prestress 
accuracy and accuracy during stress was related to the hysteria-psychasthenia personality di- 
mension. The accuracy score (per cent attempted which were correct) for the prestress and stress 
periods were T-scaled and the difference between these T-scores was computed. The difference 
score was the basis for dividing the Ss into three categories: those who shifted toward accuracy 
(toward the psychasthenic end), those who did not shift at all, and those who shifted toward 
inaccuracy (toward the hysteric end), going from prestress to stress. Since there was a general 
tendency to become inaccurate under stress, in terms of the raw accuracy scores, the categories 
represent relative rather than absolute shifts. 

In sample 1, the Cornell Index“) was used as the predictor of the adjustment criterion. In 
sample 2 an Adjustment Key on the MMPI“) was the predictor. Both of these tests are person- 
ality questionnaires which have shown promise in earlier phases of the previously mentioned 
pilot selection research program. 

For each of the categories defined by the difference score on the McKinney Reporting Test, 
the predictor test was correlated with the adjustment criterion, and it was hypothesized that the 
correlations would vary systematically as a function of the difference score. The hypothesis was 
not stated any more exactly, since plausible arguments could be made for either extreme on the 
accuracy dimension giving the most valid answers. 


RESULTS 

On the basis of the difference score computed on the McKinney Reporting Test, 
the Ss in Sample 1 were divided as nearly as possible into three categories with equal 
Ns, representing those who shifted toward accuracy, did not shift, and shifted toward 
inaccuracy, going from prestress to stress. Within each of the subsamples the Cornell 
Index scores were correlated with the adjustment criterion, separately for each shift 
category. The results are shown in Table 1. Overall, the correlation was highest for 
those who shifted toward accuracy, and lowest for those who shifted toward in- 
accuracy. The total correlations for these categories differed significantly from each 
other at the .05 level, when tested by a formula from McNemar“!: ». “8), Neither 
extreme category differed significantly from the no shift category. There was marked 
fluctuation from subsample to subsample, indicating a rather low level of consistency 
in the relationship. The variance of the Cornell Index scores for the shift to in- 
accuracy category was significantly greater than the variances of the other two 
categories, indicating that restriction of range was not a factor in the correlational 
differences. The variances of the Adaptability Index did not differ from category to 
category. 
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Tas_e 1. Pearson Propuct-MomMENT CoRRELATIONS BETWEEN THE CORNELL INDEX AND 
ADAPTABILITY INDEX AS A FUNCTION OF SHIFT IN ACCURACY ON THE McKinney Reportine Test 








Shift to accuracy No shift Shift to inaccuracy 
aaa | as 


Cornell Index Cornell Index 
GAB Subsamples r 8.D. N r 8.D. 
A -.47** . -.05 : -.24* 
B . * -.37** 32 -.06 
Cc : F -.10 2. -.03 
TOTAL . J -.16* : -.10 


r 

















*Significantly different from zero at the .05 level. 
**Significantly different from zero at the .01 level. 


On the basis of the findings from Sample 1, a specific hypothesis was formulated 
to be tested in Sample 2 that validity of scores on the MMPI Adjustment Key would 
be highest for those Ss who shifted toward accuracy on the McKinney Reporting 
Test, and least for those who shifted to inaccuracy. In an attempt to provide a more 
sensitive test of the differences between the extreme shift categories, the Ss in Sample 
2 were divided at the first and third quartiles of the distribution of McKinney 
difference scores, with one-fourth of the Ss being placed in the shift to accuracy 
category, one-fourth in the shift to inaccuracy category, and one-half in the no shift 
category. Since the criterion of adjustment in Sample 2 was a high-low dichotomy, 
point biserial correlation coefficients were computed to measure the validity of the 
MMPI scale, and the t-test, as described in Walker and Lev “*: »- 2) was used to 
estimate the significance of the correlation. The results are shown in Table 2. The 
differences in correlations support the hypothesis. 

Tas.e 2. Pornr BiserRtAL CoRRELATIONS BETWEEN MMPI Apsustment Key AnD THE Hiau-Low 


ADJUSTMENT CRITERION, AS A Function oF Suirt in Accuracy ON THE McKINNEY 
ReportinG Test 








Shift to accuracy No shift Shift to inaccuracy 
MMPI MMPI MMPI 
Samples Tpb 8.D. N Tpb 8.D. N Tpb 8.D. 


| -.39* 12.90 49 | -.36%* 10.88 102 | -.14 16.38 
CCT Pilots | =e 12.74 34 | -.32 11.50 113 | -.06 9.31 
CCT A/Cs | -.55** 10.21 36 | -.25°* 11.05 108 | -.31* 10.61 
TOTAL | -.29°* 12.09 119 | -.27*" 11.20 323 | -.14 12.73 





GAB Cadets 











*Significantly different from zero at the .05 level. 
**Significantly different from zero at the .01 level. 


The significance of the differences was tested by a method described by Tate 
(12, p. 1086), None of the correlations differed significantly at the .05 level; however, 
those in the shift to accuracy and no shift categories differed from the correlation 
for the shift to inaccuracy category at less than the .10 level (one-tailed test). In 
view of the fact that five out of six groups gave results in the same direction, and 
these results were significant for sample 1, it is concluded that a consistent, though 
slight, relationship existed in these samples between accuracy and response validity. 


DIscussIon 

The data support our theoretical position that the extraversion-introversion, 
or hysteric-psychasthenic, dimension is a variable related to response validity on 
personality questionnaires. But it is possible this relationship is due to biases in the 
research design. There appear to be three main sources of bias in any predictive 
study: selection of criteria, selection of predictors, and selection of samples. 

Did the criteria in this study—all based on ratings of peers or psychologists— 
favor the valid identification of adjustment level among psychasthenics? The 





172 SAMUEL C, FULKERSON 


hysteric typically tends to deny symptomatology; the facade presented to the 
world seems to be that of a nonchalant, socially outgoing person whose troubles 
spring from outside himself, rather than from his own inadequacies. This appear- 
ance matches a common conception of the “good pilot’’, and possibly this facade 
“fooled” both the clinicians and fellow trainees who made the ratings on which the 
adjustment criteria were based. However, if this were so, one would expect fewer 
ratings of low adjustment given to those who, on the McKinney Reporting Test, 
shifted toward inaccuracy. The data do not support this deduction. The mean 
Adaptability Index for all of those in sample 1 who shifted toward accuracy was 51.15; 
for those who shifted toward inaccuracy it was 49.40. These differences are not 
significant, and are in the reverse of the deduced direction. Furthermore, the 
variances of the Adaptability Index scores are equivalent for all groups. There does 
not seem to have been any bias in the criteria related to the hysteria-psychasthenia 
dimension. 

Did the predictor measures contribute to the differences in response validity? 
The Cornell Index and MMPI items dealt mainly with symptoms related to emo- 
tional disturbance. These may be the kind of items which primarily describe the 
symptoms of the maladjusted psychasthenic. If so, the test scores should be lower 
for those who shifted toward inaccuracy than for those who shifted toward accuracy. 
This deduction was not borne out by the data. In sample 1, for instance, the mean 
Cornell Index scores for the shift to accuracy and shift to inaccuracy groups, res- 
pectively, were 2.21 and 2.62, which were not significantly different. 

Did the selection of the samples bias the relationship? The groups used in the 
study were composed by different methods, by different researchers, over a three 
year period of time, and based on different criteria. There would seem to be no clear- 
cut reason to suspect the kind of bias in selection that would cause the relationships 
found in this study. It appears, then, that some combination of characteristics re- 
lated to the hysteria-psychasthenia dimension influenced the results. The hysteric 
group was defined in terms of a shift toward more careless, inaccurate performance 
on the McKinney test; possibly the lower validity coefficients obtained in this group 
were due to the same carelessness, 7.e., that the hysteric tends to be inaccurate in his 
reading of the questionnaire items, and impulsive in his answering, thus introducing a 
greater measure of error variance. Tables 1 and 2 indicate a general tendency for the 
variance of both the MMPI and Cornell Index scores to be greatest for those who 
shift toward inaccuracy, as would be expected if the error term were greatest in this 
group. It is therefore suggested that the results are due to differential tendencies to 
be careless in hysterics and psychasthenics. 

The present study represents the application of a method which has not re- 
ceived much attention in personality testing, one aimed at the identification of 
error. Ghiselli®: ') has used this approach in an industrial selection problem. If a 
scatter diagram is envisioned, representing the correlation between a predictor test 
(X) and a criterion (Y), it can be seen that some points lie nearer the regression line 
of Y on X than others. The difference between the regression line and the point, 
irrespective of direction, reflects the error of estimate. If a third variable existed 
which correlated with the (Y-Y’) residual error term, it could be used to identify 
those which lie near the regression line. While the present study is not a strict applica- 
tion of this method, the logic is similar. The McKinney accuracy score presumably 
could not correlate with the validity of the questionnaires if it did not correlate with 
the (Y-Y’) term, since differences in validity directly reflect differences in (Y-Y’). 

The implication of this model is that one does not make a decision for all cases, 
but only for those for whom the test is valid. While this procedure has not received 
much attention from researchers, in an informal way it has been used by working 
clinicians, and has been advanced as an argument for why negative findings—e.g., 
on the Rorschach—should not be assumed to invalidate the test. The clinician may 
say, for instance, that he feels the experience ratio is not really a true reflection of the 
inner resources of a given patient; and he will dismiss research showing that there is 
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no correlation between the experience ratio and inner resources by saying that the 
study does not “take enough into account’’. One implication of this criticism is that 
patterns of response, rather than single responses, correlate with inner resources. 
Another is that there are only certain people for whom the relationship holds. This 
study suggests that this latter possibility should receive greater attention than has 
been afforded it in the past. 


SUMMARY 


Two samples of Air Force trainees were selected for which ratings of adjustment 
had been made. A personality questionnaire (the Cornell Index or the MMPI), 
and a counting task involving speed and accuracy (the McKinney Reporting Test), 
had been administered to each trainee. 

It was hypothesized that the correlations of the personality test with the ad- 
justment criterion would vary as a function of an accuracy score on the McKinney 
Reporting Test. The accuracy measure was assumed to reflect status on a hysteria- 
psychasthenia, or extrovert-introvert dimension. 

Consistently lower validities for the personality tests were found within the 
group which was at the hysteric end of the personality dimension. Differences in 
carefulness of responding seemed to be an important variable determining this re- 
lationship. 
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THE SIXTEEN PERSONALITY FACTOR TEST IN 
CLINICAL PRACTICE 


SAMUEL KARSON!?:? 
Dade County Child Guidance Clinic, Miami, Florida 


PROBLEM 


Although the Sixteen Personality Factor Questionnaire has an impressive re- 
search background and theoretical foundation, as yet there is little actual empirical 
evidence of its usefulness with psychiatric patients“: ?). The purpose of this paper 
is to present some statistical information bearing on its validity with practical 
criteria which may help gain more acceptance for this test in clinical practice“ ®). 
Specifically, this research attempted to determine whether the 16 P. F. test could 
successfully discriminate two samples of psychiatric patients from a sample of 
“normals”’ as well as discriminating the two patient samples from one another even 
though one of the patient groups was self-motivated to be revealing while the other 
was cautious and guarded in their approach to the test situation. 


PROCEDURE 


All of the subjects included in the study were U. 8. Air Force enlisted men 
stationed in the Washington, D.C. area. Statistical comparisons showed no signi- 
ficant difierences among any of the three groups on such variables as age, educational 
achievement and IQ. All Ss were individually administered Form A of the 16 P. F. 
test by the staff psychologist. 

Group I consisted of 199 Caucasian, native born, enlisted men of the U.S. Air Force. All of 
the subjects had at least one year of productive service in the Air Force with no known records of 
court martial or psychiatric history. Service records were consulted and the opinions of first 
sergeants and commanding officers were obtained in order to insure that the sample of “normals” 
actually were well adjusted to Air Force life. This sample had a mean age in years of 23.2 with a 
standard deviation of 2.8. Their mean educational attainment was 11.2 with a standard deviation 
of 1.75. They had a mean score on the Armed Forces Qualifications Test at the 55th percentile. 


Group II consisted of 48 U. 8. Air Force enlisted men, all of whom had been diagnosed as 
anxiety neurotic by the same board certified psychiatrist. All of these patients were later placed 
on a waiting list for group psychotherapy. Almost all were self-referrals. In interview they tended 
to be anxious, dependent and revealing. 


Group III was composed of 37 U. 8. Air Force males who were referred ‘to the Psychiatric 
Service from other departments of the hospital. These cases consisted of psychosomatic and 
general medical problems in whom psychiatric involvement was suspected by the referring 
physician. In interview these patients tended to be aloof, hostile and obstructive. 


RESULTS AND Discussion 

Table 1 presents the raw score means and standard deviations of each of the 
three groups for all of the sixteen personality factors. Using a two-tailed test, 
critical rations were computed in order to obtain the results given in Table 1. 
With regard to the comparisons between ‘‘normals” and anxiety neurotics, twelve 
factors on the 16 P. F. test differentiated the two groups. Four of the five major 
factors loading the second-order questionnaire factor of Anxiety-vs.-Dynamic Inte- 
gration reached the .01 level of confidence, namely, C (Emotional Stability-vs.- 
Dissatisfied Emotionality), O (Guilt Proneness-vs.-Confident Adequacy), Q* (High 
Ergic Tension-vs.-Low Ergic Tension), and L (Protension-vs.-Relaxed Security). 
Factor Q* (High Self-Sentiment Formation-vs.-Poor Self-Sentiment Formation) 
was significant at the .05 level. The findings are in direct agreement with the concept 
validity of these particular factors and strongly support the interpretation given 
them in the test manual: ??. 


1The data for this study were collected while the author was stationed at the 1100th U.S. Air Force 
Hospital in Washington, D. C. 

*Grateful acknowledgement is expressed to Captain James Moyer, U. 8. Air Force, who made 
available the data on the well-adjusted Air Force group. 
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*With df = 245, or 234, t must = 1.97 or 2.60 for significance at the .05 and .01 levels respectively, 
using a two tailed test; with df = 83, t must = 1.99 or 2.63. 

a Raw score mean of 199 well adjusted USAF males. 

b Raw score mean of 48 USAF male psychiatric outpatients prior to group therapy. 

c Raw score mean of 37 USAF males referred to the Psychiatric Service. 





Other factors which discriminated the two groups at the .01 level included F 
(Surgency-vs.-Desurgency), H (Parmia-vs.-Threctia), I (Premsia-vs.-Harria) and M 
(Autia-vs.-Praxernia). Here again all the obtained differences are in the direction 
expected on the basis of the interpretation given these factors in the test manual. The 
neurotics are found to be significantly more depressed, more timid and shy, more 
emotionally sensitive and more bohemian and unconcerned. 

Factor G* (High Group Conformity-vs.-Low Group Conformity) and Q? (Self- 
Sufficiency-vs.-Group Dependency) were significant at the .05 level of confidence. 
It is of interest to note that in military life, low group conformity and low group 
dependency apparently do not contribute to successful adjustment. Factor Q! 
(Radicalism-vs.-Conservatism) also significantly differentiated the two groups. 
Here again the higher mean score of the patient group on this factor can readily be 
encompassed within the framework of Air Force life. With the exception of Factor 
A, all of the factors loading the second-order questionnaire factor of Extraversion- 
vs.-Introversion successfully discriminated the two groups. In short, the patient 
to is seen as highly anxious and somewhat less highly introversive on the 16 
P.F. test. 

When the group of “somatizers’”” was compared with the “normals”, seven 
factors successfully discriminated the two groups, namely, A (Cyclothymia-vs.- 
Schizothymia), B (General Intelligence-vs.-Mental Defect), C, I, O, Q? and Q*. The 
patients are seen as less warm and sociable, more intelligent, less stable, more emo- 
tionally sensitive, suffering more guilt feelings, being more self-sufficient and of 
higher ergic tension than the well-adjusted group. 

It should be reiterated that the motivation of these patients at the time of 
testing very likely served to influence their test responses. It is probably a tribute 
to the selection of items on the 16 P.F. test that despite the motivational distortions 
on the part of these patients, factors C, O and Q‘ still succeeded in discriminating 


’Experience with factor G has necessitated its reinterpretation as High Conformity to Group 
Standards-vs.-Low Conformity in preference to its identification by Cattell@ °- 1) as Super-Ego 
Strength-vs.-Lack of Internal Standards. This distinction becomes necessary in order to separate 
those who act out their conflicts from non-actor-outers even though both may score low on G, i.e., 
psychiatrists and psychologists tend to score uniformly low on this factor. 
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this group from the “normals’’. That the test is probably fakeable to some degree 
may be gathered from the fact that the anxiety-neurotics who were motivated to 
reveal themselves were more easily discriminated from the “‘normals’’ than were the 
“somatizers”’. It is also notable that although certain raw score mean differences 
between the ‘“‘normals’’ and “‘somatizers’’ did not reach the .05 level of confidence 
using a two-tailed test, the mean differences on factors F, G, H and L are all in the 
expected direction. 

When the two patient groups were compared, three significant mean differences 
were obtained on factors F, M and Q'. The anxiety-neurotics were more depressed, 
tended toward greater bohemian unconcern and were more radical than the ‘‘soma- 
tizers’’. Had this latter group been more successful in its efforts to distort their test 
responses, significant differences on certain test factors might well have been ex- 
pected to emerge between the two patient groups. The fact that such differences 
were not obtained can be interpreted as indicating that the test items cannot easily 
be faked by psychiatric patients who were apparently motivated to be non-revealing. 


SUMMARY AND CONCLUSIONS 


In order to test certain aspects of the concept validity of the Sixteen Personality 
Factor Questionnaire, two groups of U. 8. Air Force male psychiatric patients were 
compared with a well-adjusted group of Air Force enlisted men and with each other. 
There were no significant differences among the three groups with respect to age, ed- 
ucational achievement or IQ. One patient group, consisting of 48 anxiety-neurotics, 
was self-motivated to be honest and revealing in its responses to the test. The other 
patient group, composed of 37 general medical and psychosomatic cases, was self- 
motivated to be cautious and guarded in its test responses. The response set of the 
“normals’”’ probably varied between these two extremes, but on the whole they ap- 
peared to be honest and frank in their test responses since it was made clear to them 
that the test results would in no way effect their Air Force careers. Using a two- 
tailed test, critical ratios were computed on the raw score test means of the three 
groups. The 16 P. F. test discriminated the “normals” from the anxiety-neurotics on 
twelve of the sixteen personality factors measured by the test, including all of the 
factors which load the second-order questionnaire factor of Anxiety-vs.-Dynamic 
Integration. The test also discriminated the ‘‘normals’’ from the ‘‘somatizers” on 
seven personality factors. When the two patient groups were compared, significant 
differences emerged on three factors. These results strongly support the concept 
validity of certain factors on the 16 P. F. test. The findings also suggest that this 
test is not easily subject to motivational distortion by psychiatric patients inclined 
to be resistive and hostile. 
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CONSTRUCT VALIDITY OF THE CHICAGO Q-SORT: 
FRUSTRATION TOLERANCE! 


ERNEST J. DOLEYS AND JOHN KREGARMAN 
Marquette University University of Texas 


PROBLEM 


The Chicago Q-sort has been used as a measure of self-acceptance in studies of 
counseling process and outcome. Butler and Haigh“? point out that the relationship 
between an individual’s sort for the self-concept and his sort for the ideal self-concept 
should logically measure that person’s degree of self-acceptance. Except for the 
logical approach to the validity of this use of the Chicago Q-sort, however, no in- 
formation exists regarding the meaning of scores obtained by the use of this instru- 
ment. In an earlier paper, Doleys® presented a derivation from self-theory with 
supporting evidence that self-acceptance is positively related to frustration toler- 
ance. A portion of an investigation by Solley and Stagner® further supports this 
derivation. 

Frustration tolerance might be used as a criterion for an examination of the 
construct validity of the Chicago Q-sort. If this Q-sort is validly measuring self- 
acceptance, then scores on it should be significantly related to frustration tolerance. 
Frustration tolerance is often defined in terms of the amount of learning which oc- 
curs under stress“*’. It is possible to define at least two types of stress: (a) personal 
stress, in which the Ss are ego-involved in the task so that their experience of failure 
is potentially upsetting or emotional; (b) cognitive stress, in which ego-involvement 
is minimized in the Ss, and presumably their experience of failure should be less 
emotional. Both Doleys®? and Solley and Stagner®? used cognitive stress situa- 
tions, although within the framework of self-theory “?, both type of stress and degree 
of self-acceptance would affect the frustration tolerance of individuals, since emo- 
tional and cognitive factors in the organism are clearly distinguished in the theory. 

This investigation tested the following specific hypotheses: I. There is a signi- 
ficant relationship between self-ideal similarity on the Chicago Q-sort and frustration 
tolerance. II. Frustration tolerance varies significantly as a function of the combined 
effects of level of self-acceptance and type of frustration. 


METHOD 


Subjects. Three hundred students were selected systematically from the University 
of Texas student directory to obtain a representative sample of the undergraduate 
population. Each of these was sent a letter requesting him to participate in the in- 
vestigation and offering a stipend of two dollars if the individual completed the 
experiment. Seventy of this group appeared for testing, and of these, three were not 
available for the experimental portion of the study. This left a final sample of 67 Ss. 


Procedure. All Ss were given the 74 items of the Chicago Q-sort which had been found 
to differentiate between good and poor adjustment“, sorting these into an eleven 
pile forced normal distribution. One sort was obtained for self-concept, and one was 
obtained for ideal self-concept. 

Q-technique correlations®) were obtained for each S’s sorts. These values were 
then transformed into Fisher’s z-statistic in order to normalize the distribution). 
Ss were then assigned to five levels of self-ideal correlation, ranging from very low to 
very high. Ss within each level were assigned randomly to one of two experimental 
conditions (personal or cognitive). 

The experimental task consisted of making correct English words from scram- 
bled ones. Each S was given two solvable words as a warmup. Following this, each 
S was given five solvable words as a pretest, three unsolvable words as the frustration 


1This investigation was undertaken with the financial assistance of the Hogg Foundation for 
Mental Hygiene while the senior author was employed by the University of Texas. 
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condition, and five solvable words as the posttest. Ss believed all the unsolvable 
words to be solvable. 

In the cognitive frustration task, Ss were merely stopped after 120 seconds with 
the unsolvable words, and the next word in sequence was presented. Thus the failure 
experience for these Ss was merely that of being interrupted after a period of time. 

In the personal frustration task, E attempted first to establish a friendly rela- 
tionship with each S. He than proceeded with the warm-up words and the pretest 
as in the cognitive task. After 120 seconds with the first unsolvable word, E stated, 
in a puzzled tone, ‘“You missed that one, didn’t you?”’ After S had spent 120 seconds 
with the second unsolvable word, EF asked, with pronounced sarcasm, ‘“‘You’re having 
a real tough time with these, aren’t you?” After S spent 90 seconds with the third 
unsolvable word, E began to tap his pencil nervously. After 120 seconds, FE took 
away the card, commenting resignedly, ‘Oh, well,” in such a way as to imply that 
he considers S an idiot and is no longer concerned with S’s further performance. The 
posttest was then carried out in the same fashion as in the cognitive task. 


, 


REsULTsS AND Discussion 

Lindquist’s treatment X levels design, with five levels of self-acceptance and 
two treatment conditions (personal and cognitive), was used in this investigation. 
The dependent variable, amount of improvement following frustration, was estim- 
ated by subtracting the total time to complete the posttest from the total time to 
complete the pretest. This value was corrected for pretest ability by analysis of 
covariance. Lindquist’s procedure for treating missing cases was used for the three 
persons who took the Q-sorts but did not return for the experiment itself‘: »- '*). 
A significant F-test for treatments was found. This result would be expected on the 
basis of most earlier work on frustration and verbal learning“. A significant inter- 
action effect was not found; therefore, Hypothesis II was not verified. 

Since a sigrificant difference between the two treatments was found, it seemed 
necessary to study the relationship between self-acceptance and frustration toler- 
ance within each of the types of treatment. For each group, therefore, a Pearson r 
and an eta coefficient were computed between the obtained Q-values (transformed to 
Fisher’s z) and the frustration tolerance score. In neither group was the relationship 
a significant linear one. The eta coefficient was found to be significant for the cogni- 
tive frustration task, but the shape of the relationship was extremely irregular and 
not psychologically meaningful. It can be concluded that Hypothesis I was not con- 
firmed. 

In general, the results of this investigation regarding the construct validity of 
the Chicago Q-sort were inconclusive, and inconsistent with those obtained earlier by 
Solley and Stagner® and by Doleys®?. 

The lack of positive findings may reflect the possibility that the Chicago Q-sort 
does not validly measure the frustration tolerance component of self-acceptance. 
Unfortunately, drawing this conclusion would necessitate acceptance of the null 
hypothesis. On the other hand, the previously mentioned studies “: 7: *) found positive 
results using different measures of self-acceptance. It would appear that further 
research on the validity of this Q-sort might well be undertaken. If the results con- 
tinue to be negative, the probability of committing a Type II error would be lessened, 
and the conclusion of invalidity of the instrument could be made more safely. 


SUMMARY 

This study investigated the construct validity of the Chicago Q-sort. The 
criterion for validating the instrument was frustration tolerance, obtained from a 
personal stress situation and from a cognitive stress situation. Ss, after being assigned 
to levels on the basis of the Q-sort correlation, were randomly placed in one of the 
two treatment conditions. The frustration tolerance score consisted of the amount 
of learning which occurred between pre-stress and post-stress trials using scrambled 
words. The results were inconclusive, but tended to imply that the Chicago Q-sort 
does not validly measure the frustration tolerance aspect of, self-acceptance. 
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A VALIDITY STUDY OF THE SEMANTIC DIFFERENTIAL TECHNIQUE 


AUSTIN E. GRIGG 
University of Texas 


INTRODUCTION 


Osgood’s semantic differential technique ®: * * is a relatively brief and computa- 
tionally simple method for the measurement of the meaning that a client assigns to 
the “self”? or “ideal self’, and provides for measurement of the discrepancy be- 
tween these concepts. The assumptions are made that “the process of description or 
judgment can be conceived as the allocation of a concept to an experiential contin- 
uum defined by a pair of polar terms. . . (and) a limited number of such continua can 
be used to define a semantic space within which the meaning of any concept can be 
specified.’’°. P. 3) Analysis reveals three recurring factors: (a) an evaluative factor, 
represented by such polar terms as ‘‘good - bad’’ or “sweet - bitter’, (b) a potency 
factor, represented by such polar terms as “‘strong - weak” or “‘loud - soft”, and (c) 
an activity factor, represented by such polar terms as ‘‘fast - slow” or “tense - re- 
laxed’’. Subjects are given several polar terms for each of the three factors and are 
asked to assign a given concept (7.e., “my self’’) on a seven point scale for each polar 
term. When subjects go through this process for a number of different concepts, it is 
possible to determine the distance between the assigned values for these concepts. 

The present study tested whether the semantic differential scores of a group of 
normal subjects reflect greater distance between ‘“‘ideal self’’ and “neurotic” than 
between ‘‘self” and ‘‘neurotic’’, and whether semantic differential scores obtained 
when judging an actual case would shift in a predicted direction as a result of experi- 
mental manipulation of the bases of judgment. 


METHOD 

Forty-two university undergraduates who had completed the portion of a 
course in abnormal psychology dealing with psychoneurosis were asked to use the 
semantic differential for their “‘self’’, “ideal self’ and for ‘neurotic’. Two days later, 
all subjects read a carefully edited selection about Miss X that was highly favorable 
to her. Then, the subjects completed the semantic differential scales on Miss X. 
Immediately at the conclusion of this, thirty of the subjects received a second 
selection about Miss X that indicated that Miss X was a neurotic who had sought 
help at a clinic and the selection stressed the concept of neurosis without supplying 
any additional behavioral or social data. Twelve control subjects were given a sheet 
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with instructions to wait quietly while others were reading. Then the experimental 
and control groups completed the semantic differential scales on their impression 
“now” of Miss X. The prediction was made that the experimental subjects’ second 
rating of Miss X would be closer to their meaning for neurotic than their first rating 
of Miss X, and that the control group would show no significant change. 


REsvULtTs 


Table 1 gives the means and standard deviations for the distance scores between 
self-vs.-neurotic and ideal self-vs.-neurotic. The ¢ test for difference between cor- 
related means was employed and was significant at .001 level for a one-tailed test 
(t = 3.546). This supports the semantic differential since the results are in the ex- 
pected direction, namely, that normals place their ideal self significantly further from 
their concept of neurotic than they place their concept of self. 


Tas 1. Distance Between Setr, Ipeat-Set¥F anp NEvrRoTICc 
N Mean 8D t p 
Self vs. Neurotic 42 3.19 1.22 
Ideal vs. Neurotic 42 4.43 1.25 








3.55 .001 





In the second phase of the study, the experimental group placed Miss X closer 
to neurotic after the second reading than after the first and favorable reading. The 
control group did not move significantly closer to the neurotic meaning at the second 
scoring, although Miss X was shifted slightly toward the neurotic locus. These re- 
sults are shown in Table 2. 


TaBLe 2. Distance BETWEEN NEUROTIC AND Miss X 





1st judgment 2nd judgment 
N Mean SD Mean SD t 


Experimental 30 3.63 1.45 3.45 1.27 2.090* 
Control 12 3.62 1.43 3.57 1.47 0.728 


*Significant at .025 level. 


Although the ¢ test is significant for the experimental group and not for the con- 
trol group, what is needed is a testing of the differences between the two groups. 
Inspection of the data indicated a high correlation between initial and second meas- 
ures (by actual computation, .958). It was decided, therefore, to employ McNem- 
ar’s®) analysis of covariance for difference scores. This provides a method for cor- 
recting second testing means for individual differences in first testing scores with 
allowance for the correlation between the first and second testing scores and is more 
sensitive than simple comparison of differences between the experimental and control 
groups. By the analysis of covariance, the corrected differences between the experi- 
mental group and the control group are not statistically significant. 








SUMMARY 


Using the semantic differential technique, a group of normal subjects indicated 
significantly greater distance between “ideal self’ and “neurotic” than between 
“self” and “‘neurotic’’, a result favorable to the validity of the semantic differential. 
Later, the group was divided into an experimental and a control group and both 
groups read the same favorable selection about Miss X and then rated her on the 
semantic differential. Then, the experimental group read a statement which stressed 
that Miss X was not as well adjusted as appeared on the surface, and the label 
‘neurotic’? was emphasized in the statement. The control group received no addi- 
tional data on Miss X. Both groups were asked again to indicate their impressions 
of Miss X by the semantic differential. Although the experimental group shifted in 
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the expected direction, an analysis of covariance indicated that differences between 
the changes in the two groups were not statistically significant. 
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ANAGRAM-SOLVING ABILITY AS AN INDEX OF ANXIETY 
ROBERT B. TALLARICO AND E. EDWARD REITMAN 
University of Miami 


Recently, Wiggins“: *) reported a significant relation between multiple solution 
anagram-solving ability and anxiety as measured by the Taylor Manifest Anxiety 
Scale. To a group of 68 night school psychology students he first administered the 
anagram test and then the MAS which may have been influential in determining the 
obtained significant negative r. However, no control group was used to determine 
the effect of the order of test administration on the correlation. The purposes of the 
present study were to validate the anagram test as an index of anxiety on a larger 
student group, and to introduce a control group to note the effect, if any, of order of 
test presentation. 


PROCEDURE 


The subjects in this study were 176 college students in introductory psychology 
courses tested during the summer session. These students were randomly divided 
into two groups, A and B. Group A (N = 91) was first administered the anagram- 
solving test and then the MAS in accordance with Wiggins®); group B (N = 85) 
was administered the same two tests in an inverse order. 


RESULTS AND DIscussION 

Pearson r’s were obtained between anagram and anxiety scores. For group A, 
the r was —.102; in group B, the r was .013. These correlations were not significant 
at the .05 level. Ignoring the order of test administration, the r between all the 
scores combined was —.033, not significant. These results do not support Wiggins’ 
findings that anagram-solving ability is an index of anxiety. Moreover, they reveal 
that order of test presentation has no effect on the correlation. Therefore, if the 
anagram-solving test is to be used as a rapid clinical diagnostic tool (Wiggins, 1956), 
there would still seem to be the need to determine what it measures. 


SUMMARY 


Wiggins (1957) reported a significant relationship between anagram-solving 
ability and anxiety as measured by the MAS. The purpose of this study was to vali- 
date the anagram test on a different sample and to determine the effect of order of 
test administration upon the correlation. Using 176 summer school psychology 
students, the results showed no significant correlation between the two tests, and the 
correlations were not influenced by the order of the test administration. 
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THE RELATIONSHIP OF SEX AND INTELLIGENCE TO CHOICE OF 
WORDS: A NORMATIVE STUDY OF VERBAL BEHAVIOR* 


GOLDINE C. GLESER, LOUIS A. GOTTSCHALK, AND WATKINS JOHN 
University of Cincinnati, College of Medicine 


PROBLEM 


The objective analysis of verbal behavior is becoming an important experi- 
mental means of assessing changes in psychotherapy, demonstrating various psycho- 
somatic relationships, and exploring directly the symbolic processes of man inherent 
in his language and his language usage.“ * *) For many such studies it is important 
to know whether choice of word-type varies with sex and intelligence. To this end 
we have investigated the relationship of these factors to the types of words used by 
an occupationally adjusted and medically healthy population. The specific purposes 
of this study were: (a) to determine the relation of sex and intelligence to selected 
speech variables, (b) to obtain norms for these speech variables (considering sex and 
IQ differences wherever pertinent), and (c) to determine the reliability of the speech 
variables as measures of individual differences and their interrelations. 


METHOD 


A normative sample of verbal behavior under standardized conditions was ob- 
tained on 90 Caucasian subjects, between the ages of 20 and 50, who were gainfully 
employed in one place for at least six months with a satisfactory work record and no 
evidence of disabling psychiatric or medical problems. The sample was stratified on 
the basis of sex and intelligence using three levels corresponding roughly to dull 
normal, high normal, and superior IQ. The majority of the subjects were employees 
at the Manufacturing Division of the Kroger Company in Cincinnati, Ohio.' Since 
it was originally intended to obtain all subjects from the Kroger plant where the 
currently used intelligence test is the Wonderlic (Form A or B), this test was used 
as the basis for identifying level of intelligence. The Wonderlic score was adjusted 
for age® ».*®), and the equivalent IQ obtained from Kroger normative data. The 
dull normal group had IQ’s ranging from 80 through 100 by this test (corrected raw 
score 10 through 19) ; bright normal, 101 through 115 (corrected raw score 20 through 
29); and the superior, 116 and above (corrected raw score 30 and above). 

It was planned to select 15 individuals of each sex from each IQ level who satis- 
fied the criteria of age, work adjustment and medical health indicated above. The 
subjects were selected on the basis of personnel records providing information on 
educational level, IQ (Wonderlic test), work adjustment, personality test findings, 
medical health records, etc. When an employee satisfied all criteria for selection, 
he was asked to participate in the study by giving a five-minute verbal sample. Un- 
fortunately, it was impossible to fill our quota of women of superior intelligence from 
workers at the Kroger plant, so this group was supplemented with women working 
at the Cincinnati General Hospital in various capacities and with wives of medical 
school faculty members. In each case the Wonderlic test was administered and the 
score used as a criterion for inclusion. Table 1 presents the pertinent biographical 
statistics for the 90 subjects showing diverse occupational classifications. There is a 
close relationship between IQ and education in the sample. The product-moment 
correlation between them is .72. Thus differences in verbalization among the IQ 
groups may reflect difference in educational background. 


*This study was supported in part by a research grant (M-1055) from the National Institute of 
Mental Health, National Institutes of Health, U. S. Public Health Service, Department of Health 
Education and Welfare. 

1We are indebted to the Kroger Company for giving their employees’ time and opening their 
personnel files to expedite this study; without this cooperation from a major local industry with our 
research team, this investigation would have been seriously hampered. 
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TasLe 1. BroGrapaicaL Data oN Normative SAMPLE. 








Means 
-. Education 
(Wonderlic) (years) 


110.6 12.3 
108.2 


92.2 
109.7 


Z 





Males 
Females 


Dull Normal 
Bright Normal 
Superior 121.6 


Total Group ; 109.4 
Standard Deviation ‘ 14.1 


S$ $88 && 





OccUPATIONAL CLASSIFICATION 
Occupation Occupation 


Food processing Professional 
Secretarial Maintenance 
Accounting 
Managerial Dietetics 
Laboratory, technical Miscellaneous 





A five minute sample of speech was elicited from each subject using the following 
standardized instructions from previous studies“: *): ‘“‘This is a study of speaking 
and conversational habits. Upon a signal from me, I would like you to start telling 
me about any interesting or dramatic life experiences you have had. Once you have 
started, I shall be here listening to you, but would prefer not to reply to any questions 
you may feel like asking me until the five minutes are up. Do you have any questions 


you would like to ask me now before we start? Well, then, you may start.’’ The sbu- 
ject spoke into a microphone connected to a tape-recorder. None of the subjects had 
participated in any of our previous studies, nor did they know beforehand what they 
were expected to do. 


ANALYSIS OF DATA 


Each word in the five minute speech samples was categorized using a two-way system of 
classification in which the word is coded according to its ‘‘grammatical’’ and its “psychologic” 
function. This system is a modification from a previous study“). The grammatical categories 
were adjectives (including articles), adverbs, verbs, substantives (nouns and pronouns), preposi- 
tions, conjunctions and interjections. 

The psychological categories attempted to classify words according to the emotive, cognitive, 
and perceptive processes conveyed or the objects denoted, animate or inanimate, regardless of the 
part of speech. The specific categories included in this system of classification were: words de- 
noting feeling or motivation (e.g., happiness, wish, try, badly); obligation (duty, must, obli- 
gated); perceptual processes (look, itch, sensation); reflective or cognitive processes (belief, 
because, think); activity, particularly involving the neuromuscular system (messenger, talk, 
battle); relationship in time (now, week, soon); place or spatial relationship (towards, Canada, 
here); quantity (quart, many, large); quality or state of being (red, strange, exist.) Also classi- 
fied under this ‘‘psychologic” system of categories were words referring to different objects or 

nts. These were subdivided into several categories: words referring to the self (I, my, mine); 
others (her, they, his); self and others (we, our, us); flora or fauna Sone. rose); things (house 
money); the person to whom the communication is addressed (you); and vague or impersonal 
references (people, one’s). Words not falling into any of the above psychological categories were 
classified as auxiliary words. These words in general specify temporal relationships (auxiliary 
verbs), additional relationships between objects or ideas (prepositions or conjunctions) not in- 
cluded in previous categories, the definite and indefinite articles, and certain adverbs. 

In addition to the above categories, a further subdivision was made so that certain words 
carried a triple code. Thus words conveying emotion or need were further classified according to 
whether the word implied approval or approach motivation (pretty, wish), disapproval or avoid- 
ance motivation (frightful, dislike), or were neutral or non- oe in emotional valence (dram- 
atic, not needed). Similarly action was further coded according to whether it was constructive 
(build, heal, teacher), destructive (kill, hurt) or neutral (run, talkative, doing). Furthermore, 
words implying negation (not, never, improbably, disapprove) were so categorized in addition 
to being po as to grammatical and psychological function. 
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The transcribed samples of were all coded by one technical assistant and tabulated 
on a form permitting totalling words in each grammatical category regardless of psycho- 
logical function and in each psychological category regardless of grammatical usage, as well as 
any combination of the two classification systems (i.e., substantives implying time, verbs of action, 
etc.). The number of words in each category was converted to a percentage score based on the 
total number of coded words in the five minute — 

Much preliminary work was done to improve the reliability of the scoring system. A series 
of verbal — from a previous study were coded jointly by the principal investigators and on 
the basis of this experience, rules were developed for classifying certain types of words that fall 
on the borderline between two or more of our word categories. *Then 150 words from each of ten 
new verbal samples were coded independently by the two investigators and compared to deter- 
mine the amount of agreement achieved. 

he agreement on the classification according to grammatical categories ranged from 
92.7% to 96.7% of the total words classified, with a mean of 94.7%. For psychological categories 
the agreement was slightly lower, ranging from 81.3% to 90.7% with a mean of 87.8%. Complete 
agreement on coding a verbal sample according to both systems was obtained in 85.1% of the 
words. Subsequently, two or three technicians have been trained to code verbal samples until 
they were able to obtain an agreement of from 85% to 90% with the coding of the principal in- 
vestigators. In addition, all samples coded by a technical assistant were checked by one of the 
principal investigators to catch any gross errors or inconsistencies. 


RESULTS AND DiscussION 

Reliability of five minute sample. An attempt was made to obtain some measure 
of the reliability of the various scoring categories in a five minute speech sample. 
Since the verbal samples were divided into one minute intervals in the transcription, 
it was decided to tally separately two sub-sample scores of two minutes each. A 
total of 60 pairs of such consecutive two minute sub-samples were examined for this 
reliability check. Theoretically, a better reliability estimate might have been made 
by tabulating each minute separately, but it was not felt that the gain would justify 
the amount of time and effort necessary to handle the data in this fashion. 

Table 2 presents the means and standard deviations of the percentage of words 
falling in each of the major categories in successive two minute sub-samples, the 
correlation between the two sub-samples, and the inferred reliability of the scoring 
category for a five minute sample. The latter figure was obtained using the Spearman- 


TaBLE 2. CoMPARISON or Successive Two MinvuTE VERBAL Sus-Sampies (N = 60) 








Standard Correlation Estimated 
Word Category Deviation between suc- reliability 
First Second cessive two 5 minute 

2 Min. 2 Min. minutes sample 


Adjectives (A) : ; 3.25 
Adverbs (B) 
Substantives (C) 
Verbs (V) 
Prepositions (L) 
Conjunctions (Z) 
Feeling, motivation 
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*Difference in means for successive two minutes significant at or beyond .05 level 
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Brown formula. The reliabilities range from .38 (for percentage references to time) 
to .85 (for percentage references to self) with a median of .72. It is somewhat sur- 
prising that substantives, which are relatively easy to score, show such a low re- 
liability (.45). 

For most categories the mean scores do not differ significantly from one two- 
minute period to the next. However, there is a significant decrease in the percentage 
of references to time and to the self, and an increase in the percentage of references to 
vague, unspecified concepts, themes, or persons (7) from the first to the second two 


Fig. 1. StanparD Scores ON VERBAL CATEGORIES AccoRDING TO I. Q. LEVELS 
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minutes. These trends probably reflect the fact that an individual telling an incident 
usually starts out by giving the setting and specifying the relationship of various 
other people to the subject. Vague references to previous remarks and vague general- 
izations often occur toward the end of the story. 


Sex differences. The percentage scores in each of the grammatical and psycho- 
logical verbal categories mentioned above were analyzed by a double classification 
analysis of variance using sex and IQ as the independent variables. For each var- 
iable the average scores obtained by individuals in each of the IQ groups according 
to sex are indicated in Table 3, as well as the error variance and the F ratios which 
were significant at the .05 level or beyond. In only two instances was the interaction 


Fic. 2. SranparD Scores FoR VERBAL CATEGORIES DIFFERENTIATING MALES AND FEMALES 
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Tasie 3. MEAN PERCENTAGE Scores FoR Worp CaTgecorigs AccorDING To Sex aNnp IQ LEvsEL, 
Wiratn-Group VARIANCE AND SiGniFicant F Ratios 











Word Category Mean Percentage Score 
IQ Within-group F Ratios 
85-100 101-115 116+ Variance Sex IQ 


16 .88 18 .32 19.82 9.02 9.92*** 
18 .26 19.10 
11.11 11.47 4.99 
10.88 10.80 
3.89 9.96*** 


7.21 
4.96 7.12** 








Adjectives (A 
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between IQ and sex significant. These were for neutral action (Go) and reference to 
activity of all kinds (G) which were significant at the 5% level and 1% level respect- 
ively, indicating that the trend in percentage use of words of action for the three 1Q 
levels is different for the two sexes. 

All of the differences in verbal behavior which were related to sex were found 
among the psychological categories; whereas, most IQ differences occurred among 
grammatical categories. Females used a very significantly higher percentage of 
words implying feeling, emotion or motivation, whether positive, negative or neutral. 
They also made significantly more references to self and used more auxiliary words 
and negations. On the other hand, they used a relatively smaller percentage of 
words implying time, space, or quantity, the difference being primarily due to their 
less frequent use of words referring to place or spatial relations, and they used a 
smaller proportion of words referring to destructive action. Altogether, significant 
sex differences were found in 10 of the 34 variables tested, two of which were combin- 
ation or summary variables and hence overlapped some of the other eight. 

There was a tendency for all sex differences to disappear in the highest IQ 
group, with the exception of the relative frequency of words expressing emotion, but 
in no case was this trend statistically significant. The more intelligent women re- 
sembled the men with regard to their less frequent use of self-references and negation 
but the more intelligent men were similar to the women in using fewer references to 
place or spatial relations and words implying destructive action. Additional subjects 
would be necessary to determine whether these trends are valid and if so what impli- 
cations can be drawn from them. Another subject for further research might be the 
effect of the examiner’s sex on the different speech patterns of males and females; in 
our study al] speech samples were elicited by a male. 


IQ differences. Ten variables yielded significant differences on the basis of IQ, 
five significant at or beyond the .001 level. The proportional use of adjectives (in- 
cluding articles), articles alone, prepositions, auxiliary words and words referring to 
quality or state of being, increase significantly with high IQ; whereas, the proportional 
use of adverbs, verbs, interjections and references to self decrease. Only for the 
category “references to others’’ was the relationship not a monotonic one: for this 
category the middle IQ group had a significantly smaller average percentage than 
either the low or the high IQ group. 

The differences in speech pattern found here relate fairly well with casual ob- 
servation. The normal intelligent person tends to use more complex sentences in 
which ideas are related meaningfully, objects and situations are described more 
elaborately, and his own position as an observer is usually subordinated. Since 
education and intelligence are highly correlated in this sample, it would be impossible 
to say to what extent these differences stem from differences in intelligence per se 
and to what extent they are the result of formal education. Certainly environmental 
factors play a strong role in language development, but the role of formal education 
in molding the speech patterns analyzed here cannot be clearly determined. 

The intercorrelations among several of the variables found to be related to IQ 
were obtained and are presented in Table 4. From these values a multiple regression 


Tasie 4. INTERCORRELATIONS AMONG VARIOUS VERBAL CATEGORIES AND THEIR 
RRELATIONS WITH I 
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equation was determined. Conjunctions were included as a possible suppressor var- 
iable since they correlated significantly with adjectives, adverbs and prepositions 
but not with IQ. That conjunctions do act asa suppressor variable is indicated by 
their high 8 weight in the multiple. The seven variables yield a multiple correlation 
of .651 with IQ. When auxiliary words (W) are dropped, the multiple correlation be- 
comes .648, a negligible decrease. The final equations obtained for this sample are: 
Est. 1Q = 24A+08B4+1.7V +30L 42.72 —151—-31°4 foneiys) 
Since a stratified rather than a random IQ sample was used and also since the equiva- 
lent IQ scores from the Wonderlic test are in themselves estimates, it is doubtful 
that the above equation would result in a very accurate estimate of IQ for another 
sample and a different criterion of intelligence. Furthermore, a linear equation 
gives a relatively poor fit to our data since larger differences occur between the low 
and middle IQ groups than between the middle and high IQ groups. This non- 
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linear relationship might be expected since the majority of the variables involved are 
grammatical ones and these have limits to their frequency of use because of the very 
nature of our language-structure and its function as a means of communication. As 
an illustration, the use of adjectives and prepositions to the exclusion of verbs would 
result in a high score of estimated IQ but would fail to be meaningful communication 
and would be likely to require a different interpretation than as a manifestation of 
superior intelligence. However, it is possible that our estimated IQ score within 
limits might be a useful predictor, particularly where the ability to communicate 
effectively in stressful situations is important. Also, variations in this score from 
time to time for a single individual may have interesting psychodynamic correlates. 

From the standpoint of further research into the personality factors which are 
revealed in speech, it is important to take cognizance of the differences in word 
usage associated with gender and intelligence. Studies of speech patterns must take 
these factors into account, either by using a balanced sampling design or by using 
appropriate standardized scores. In order to facilitate such usage we have standard- 
ized the percentage scores obtained on this group (Fig. 1, 2,3) using separate norms 
for males and females wherever significant differences have occurred. The converted 
scores all have a mean of 50 and a standard deviation of 10. Separate standard 
scores are also given for those category scores which vary monotonically with in- 
telligence. Because of the high correlation between education and IQ for adults 
found in this study (.72) and others (median .65) “ it is suggested that when intelli- 
gence scores are not available, the low IQ norms be used for those individuals with 
less than a 12th grade education, the middle IQ norms for those with 12 to 15 years 
education and the high IQ norms for those with a college degree. 

We have mentioned that our sample was based on individuals who showed no 
evidence of disabling emotional and physical illness. It should be recognized, how- 
ever, that this group was not psychiatrically screened by techniques designed to 
uncover possible emotional or psychophysiologic disorders that might be present, 
but not severe enough to be disabling at the time. Thus, the group cannot be con- 
sidered homogeneous from the standpoint of optimal personality adjustment. How- 
ever, we believe that our norms can be used to compare the speech patterns of this 
group with those of variously physically or emotionally disabled groups of people, 
although we realize that a more homogeneous control group might make it possible 
to discern smaller differences in verbal behavior associated with various types of 
medical and personality illnesses. 


SUMMARY 


The relationship of sex and intelligence (as measured by the Wonderlic test) to 
word-type usage was explored in five-minute speech samples of a group of 90 occupa- 
tionally adjusted, medically healthy individuals. The speech samples were elicited 
by standardized instructions given by a male investigator, and the word-types were 
analyzed and scored according to two systems of categories, a grammatical and a 
“psychologic”’ system. Reliability of scoring the verbal samples has been developed 
to the point where it is possible to obtain 85-90% agreement between two scorers. 
The reliability of ordering individuals by each of the various scoring categories using 
a five-minute speech sample was checked and the median reliability found to be .72. 
Thus a five-minute sample apparently gives an adequately representative sample of 
the verbal behavior of an individual with regard to most of the word categories we 
have used. 

Under these experimental conditions, significant sex differences were found 
among certain of the “psychologic’’ word categories. Also, significant differences 
were found to be associated, step-wise, with the level of intelligence, and these 
differences occurred principally among the grammatical categories. The differences 
between the sexes in word-type usage tended to decrease at the highest level of 
intelligence. 
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Separate word-frequency norms for males and females and for those word 
categories which vary with intelligence are provided in tabular form. The norms 
presented here can be used to compare our sample with groups of individuals with 
various types of medical and personality illnesses. 
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PERFORMANCE AND VERBAL IQ IN A GROUP OF 
SOCIOPATHS 


ARTHUR N. WIENS JOSEPH D. MATARAZZO AND KENNETH D, GAVER 
Oregon State Hospital University of Oregon Medical School 


PROBLEM 

Following Wechsler’s“*) manual and other studies®: '°, many clinicians assume 
that the intelligence test results of sociopaths will, among other things, be character- 
ized by a Performance IQ that exceeds the Verbal IQ. Commenting on this assump- 
tion, Schafer writes: ‘“‘The generalizations to be advanced about them are not based 
on formal statistical investigations; they are considered to be valid because we have 
had a high degree of success in applying them in our every day work’”’“® P.™, A 
review of the research literature reveals that (a) few studies support this assumption, 
and (b) those studies that are reported use such divergent definitions of sociopathy 
or psychopathy that Rabin and Guertin“? were led to suggest that the inconsistency 
in the use and meaning of this diagnostic label meant that different investigators 
were most likely studying very different kinds of subjects. 

In the reviews of research with the Wechsler-Bellevue by Rabin“), Watson 
(3), Rabin and Guertin“, and Guertin, Frank and Rabin“? reference is made to 
five studies with adults that bear directly on the present investigation. Clark“ 
found a lower verbal score in “‘institutionalized behavior problem groups’”’ of civilians 
and American and Mexican soldiers, and thus substantiated the clinical assumption 
of a higher performance IQ in this group of offenders. However, Strother“ in a 
group of ‘“‘psychopaths,” and Thurston and Claden“ in a group of “tuberculosis 
patients who left against medical advice’’, did not find the higher performance IQ. 
Similar negative findings were reported by Clark and Moore in their differentiation 
of “military offenders” into groups of ‘‘no NP disorder and immaturity reaction’’ 
versus “psychopaths’’®), and by Gurvitz who attempted to differentiate ‘‘psycho- 
pathic” and “non-psychopathic”’ inmates in a prison“. Thus, of the five studies, 
one reported a performance IQ higher than verbal IQ, and four studies did not. 

This study investigates whether sociopathic patients earn a significantly higher 
performance IQ than verbal IQ on the Wechsler test of adult intelligence. The opera- 
tional definition of sociopath used was a behavioral one: the experimental subjects 
were a group of 112 sex offenders (all of whom, at the time of examination, already 
had been convicted in a court of sexual crime against another person) sent to the 
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Oregon State Hospital for psychiatric and psychological examination.! The criter- 
ion for inclusion in this study thus emphasized “‘overt”’ rather than “‘latent’’ behavior. 

A second observation prompted this present study; namely conflicting data 
regarding the intellectual level of sex offenders. Karpman‘ and Ellis and Bran- 
cale®) review approximately equal numbers of studies suggesting lower than average 
— average intelligence, and above average intelligence for groups of sex 
offenders. 


PROCEDURE 

From 1955 to 1957, 142 convicted male sex offenders were admitted to the 
Oregon State Hospital for study. Of this group, 112 patients were referred and stud- 
ied by means of psychological tests, including the Wechsler-Bellevue, Form I in 81 
cases, the Wechsler Adult Intelligence Scale in 29 cases, and the Wechsler Intelli- 
gence Scale for Children in two cases. 

Our subjects were these 112 males ranging in age from 14 years to 64 years, 
mean 32.4 years. Sixty per cent were between the ages of 20-39. Of this group of 
112, 50 men were married, 42 were single, 16 were divorced, three were separated 
from their wives and one man was a widower. The offenses for which they were con- 
victed were: contributing to the delinquency of a minor (29 Ss), sodomy (21 Ss), 
rape (218s), statutory rape (12 Ss), indecent exposure (5 Ss), assault with intent to 
rape (4 Ss), attempted rape (4 Ss), assault with a deadly or dangerous weapon (3 Ss), 
delinquency (3 Ss), and homosexual activities (2 Ss). Eight additional cases were 
accounted for by a variety of sexual offenses. In 90 cases the offense was listed as 
being against a child, and in 15 cases against an adult; the preponderant number of 
cases (72) involved a female child. 

The clinical psychiatric diagnosis given these men after intensive study, in 
contrast to the more rigorous behavioral classification of sociopath given to all 112, 
was sociopathic personality disturbance (72 Ss), mental deficiency (8 Ss), schizoph- 
renia (9 Ss), psychoneurosis (2 Ss), inadequate personality (4 Ss), adjustment re- 
action of adolescence (3 Ss), adult situational reaction (1 8), depressive reaction 
(1 8), schizoid personality (2 Ss), chronic brain syndrome (3 Ss), and 7 Ss without 
mental disorder. These clinical diagnoses are presented for informational purposes 
only, since they involve the well-known lack of reliability for clinical diagnoses. In 
educational background the 112 subjects ranged from no formal education to one 
year of graduate study with a mean education of ninth grade; 19 per cent had not 
completed the eighth grade and 23 per cent had more than three years of high school. 
With the exception of one man who stayed in the hospital beyond the duration period 
of this study, the range of hospitalization was from one to 882 days with a mean 
hospitalization period of 51 days. 


RESULTS AND DIscUSSION 
The range of Full Scale 1Q’s for this group of 112 subjects was from 64 to 135, 
with a mean [Q of 100.1 and a standard deviation of 15.3, these values being almost 
identical with Wechsler’s“*) standardization sample. Therefore, our sample of 112 
subjects in no way differs from a random sample of males from the general popula- 
tion with respect to general intelligence. A breakdown of the Full Scale IQ’s of the 
various sexual offense sub-groups (listed earlier) revealed average Full Scale IQ’s for 
each sub-group. Thus, a second conclusion from this initial finding is that specific 

sex offense seems to be independent of intellectual level. 
The results shown in Table 1 allow us to answer the main question of this study. 
It is clear that, as a group, the 112 sociopaths earned a statistically significant higher 
(.001 level of confidence) performance IQ than verbal IQ. Thus when the criterion 
of sociopathy is clearly defined by overt behavior the assumption that sociopaths 


1Since 1953, Oregon law has required the psychiatric examination of certain persons convicted of 
sex crimes. These limits are quite general and inclusive and pertain to all offenses involving a child 
under 16 years of age. 
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TaBLE 1. CoMPARISON OF PERFORMANCE AND VERBAL IQ For GroUP OF 
112 SoctopaTHs 


Performance Verbal Scale 
Scale IQ IQ 


Mean 104.0 97 .6 
8. D. 15.0 17.3 
Range 62-137 69-130 











have a higher performance than verbal IQ is established. While the mean difference 
(6.4 IQ points) is not large, it is clearly large enough to attain an unusually high 
degree of statistical significance, and thus can be presumed to be a reliable finding. 

Of the 112 subjects in our sociopathic group, 81 had a higher performance IQ 
than verbal IQ and 31 had lower performance IQ. This difference was significant at 
the .001 level and is a clear indication that, whether dealing with group or individual 
data, our group of sociopaths do, in fact, earn a higher performance than verbal IQ. 
It would thus appear that as suggested by Rabin and Guertin “, other investigators 
may have been studying different kinds of subjects than those that comprised our 
sample, and that, if the definitional problem of what is a sociopath could have been 
overcome, they might have found similarly positive results. 

These findings are duplicated in entirety in the subgroup of 72 subjects out of 
the 112 given the clinical psychiatric diagnosis of “sociopathic personality disturb- 
ance’. This sub-group obtained the following mean scores: Full scale 1.Q. 102.4, 
Verbal scale I.Q. 99.4, and Performance scale 1.Q. 105.5. The group difference of 6.1 
1.Q. points between performance and verbal scales is statistically significant at the 
.001 level (¢ = 5.11). In addition, taking the 72 subjects as individuals, 53 (74%) had 
a higher performance scale I.Q., while only 19 had a higher verbal scale I.Q. This 
difference also is significant at the .001 level (Chi-square 16.1). 


SUMMARY 


This study investigated whether or not a group of 112 sociopaths, defined in 
terms of overt behavior (court conviction for a sexual crime), would earn a higher 
performance than verbal IQ. Both for the group as a whole, and for the subjects 
taken as individuals, the findings were clear cut that the performance IQ is higher 
than the verbal IQ. 
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THE INFLUENCE OF AGE ON MMPI RESPONSES 
GEORGE CALDEN AND JACK E. HOKANSON 
Veterans Administration Hospital, Madison, Wisconsin 


PROBLEM 

Manuals of intelligence tests generally list intelligence norms for various age 
levels and describe intellectual functioning as inextricably related to the factor of 
age. This consideration, however, has not as yet permeated the field of objective 
personality testing despite the importance of personality changes which frequently 
accompany the aging process. The Revised, 1951 Manual of the MMPI®», for 
example, makes no reference to the influence of age upon test responses other than a 
statement suggesting that the inventory “is appropriate for administration to any 
cooperative subject from about 16 years of age upward.” Aaronson’s®) study is 
among the few which have been concerned with age as a variable in the interpretation 
of MMPI test records. A limitation of this study, however, is that it dealt only with 
single MMPI profile peaks rather than an analysis of all the standard scale and 
validity scale scores. Also, his population, taken from the MMPI Atlas“, consist- 
ed predominantly of in-patients of a psychiatric hospital. The present study, on the 
other hand, investigates the age factor on MMPI scores among relatively normal 
individuals in a non-psychiatric hospital setting. 


PROCEDURE 
The subjects consisted of approximately 90% of male tuberculosis patients who 
were admitted consecutively to the VA Hospital, Madison, Wisconsin, during a 6 
month period. The individual form of the MMPI was administered to them during 
their first month of hospitalization as part of an extensive study of the relationship 


of personality and body type to tuberculosis. Records with validity scale scores 
above a critical score of T = 70 were rejected except on the F scale where records 
having scores above T = 80 were disqualified. This procedure provided a total 
sample of 160 subjects whose average age was 42. 8 years. This sample was divided 
into five groups by decades. Analyses of variance between groups were obtained for 
each of the standard and validity scales as well as for the Taylor Anxiety Scale. 


RESULTS AND DIscuUssSON 


The F test values for each of the MMPI scales are presented in Table 1. A 
perusal of the table indicates that the Hs, D and Si scales show a significant rise as 


TasBLE 1. Comparison oF MMPI Scorss or Successtve Decapses (N = 160) 
Age Range 
20-29 30-39 40-49 50-59 60-69 
(N=44) (N=46) (N=25) (N=30) (N= 
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the subjects’ ages increase. None of the other scale scores, including the validity and 
anxiety scales show any appreciable relationship to the age factor. The significant 
changes on Hs, D and Si suggest that, in contrast to younger patients, people become 
more preoccupied with themselves both physically and mentally as they become 
older. These self-centered preoccupations seem to correspond to the decline in 
physical prowess and health, the loss in personal competence and mental capacity 
reported in studies of old age. Busse®?, for example, in his study of a large group of 
older people found depressive states, hypochondriacal reactions and tendencies to 
feel isolated from other people (wandering) to be prominent characteristics of his 
subjects. Interests are shifted away from external events and are focused inwardly, 
particularly on the body and its functioning. 

As in Aaronson’s results, the Hs and D scores tended to be elevated among the 
older groups (Si scale scores were not used in his study). However, in contrast to his 
findings, which were based on the MMPI records of severely maladjusted individ- 
uals, the Pd and Sc scores in the present study showed no significant change with age. 
In view of the large number of psychopathic and schizophrenic personalities in his 
sample, it is more likely that the profile peaks reflecting these disturbances would 
be prominent in his results. Apparently the anti-social and criminal behavior char- 
acteristic of those obtaining high Pd scores and the dissociative behavior of those 
receiving high Se scores are to be found more readily in a youthful mental hospital 
sample. These forms of behavior resulting in high Pd and Se scores, however, are 
less likely to be found in a more normal hospitalized population of a comparable age 
category. 

In view of the significant elevations in Hs, D and Si at the older age levels, a 
careful consideration of the individual’s age may be necessary for an adequate eval- 
uation of his MMPI record. For example, in this study, the mean Hs score for sub- 
jects in their twenties was 58.29, whereas the mean for subjects in their fifties was 
71.57 - a difference of 13 points. This difference suggests the need for the develop- 
ment of appropriate Hs and other scale norms for each of the age levels. 


SUMMARY 


The aim of this exploratory study was to investigate the age factor in MMPI 
records of a relatively normal hospital male population. Significant increases in Hs, 
D and Si were observed, reflecting the increased hypochondriacal, depressive and 
introversive tendencies with advancing age. The necessity for establishing MMPI 
scale age norms is suggested. 


REFERENCES 


1. Aaronson, B.S. Age and sex influences on MMPI profile peak in an abnormal population. J. 
consult. Psychol., 1958, 22, 203-206. 

2. Bussg, E. W. Treatment of the nonhospitalized emotionally disturbed elderly person. Geria- 
trics, 1956, 11, 173-179. 

3. Haraaway, 8. R. and McKmtey, J. C. Minnesota Multiphasic Inventory Manual (rev. ed.). 
New York: Psychological Corp., 1951. 

4. Hatnaway, 8. R. and Meest, P. E. An Atlas for the clinical use of the MMPI. Minneapolis: 
Univ. of Minnesota Press, 1951. 





THE RESPONSE OF PRISON INMATES TO SEVEN 
NEW MMPI SCALES 


JAMES H. PANTON 
Reception Center, Central Prison 
Raleigh, N. C. 


? 


PROBLEM 

In recent years there have been a number of new MMPI scales designed to 
measure the personality characteristics, beliefs and attitudes of persons classified 
with respect to psychosocial determinates of behavior. Seven of these scales appear 
adaptable to the measure of various aspects of personality dysfunction generally char- 
acteristic of adult delinquency. The purpose of this study is to test ‘these seven 
MMPI scales on a state prison population to determine if prison inmates respond 
significantly in the direction of asocialized patterns when compared to a sample 
drawn from the normal adult population. 


PROCEDURE 

The MMPI records of 2,374 male admissions to the North Carolina State Prison 
during 1957-1958 were scored on the following new MMPI scales: Prejudice (Pr) “?, 
Social Status (St)®’, Responsibility (Re)“’, Dominance (Do), Role Playing 
(RP) “, Dependency (Dy) “?, and Ego Strength (ES) “. The means and standard 
deviations for each of the scales were computed and compared to normative data 
recently compiled by Hathaway and Briggs®’. The t-test was employed to deter- 
mine which of the scales produced significant mean differences when comparing the 
prison and normal samples. 


RESULTS AND Discussion 


Table 1 presents the comparison of means and standard deviations. Simple 
inspection of the table reveals that the prison sample distinguishes itself from the 
normals in the direction of asocial patterns on all but the Social Status and Role 
Playing scales. The mean differences appearing between the prison and normal 
samples on these two scales are less than unity and are not statistically significant. 
It is interesting to note that the prison mean score on the Role Playing scale is of 
greater positive value than the corresponding mean score of the normal sample. 
These findings do not lend support to Gough’s® role playing theory of psycho- 
pathy. On the other hand the prison sample’s favorable showing on the Role Playing 
scale could be indicative of less indices of psychopathy among prisoners than is 
generally believed to be the case. 

The elevated Prejudice score of the prison sample presents a picture of feelings 
of resentment and distrust, apprehension over daily association with others, lack of 


TaBLe 1. CoMPaARISON OF MEANS AND STANDARD DEVIATIONS 





Scales Normal Sample Prison ple 
N = 226 N = 2374 
Mean SD Mean SD Mean Diff. ¢-Ratio 





11.81 . 14.67 . 2.86 7.73* 
18.03 ; 18.59 : -56 1.87 
20.21 ‘ 18.04 . -2.17 7.48* 
14.76 , 13.39 ‘ -1.37 5.48* 
18.52 5 18.95 ‘ 43 1.65 
44 .33 . 39.65 , 4.68 10 .88* 
19.41 F 23.76 ‘ 4.35 7 .02* 





*Significant at .01 level of confidence 
-Indicates difference of lesser degree from the standard normative mean. 
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self-regard, and a pessimistic attitude toward the future. Gough) describes this 
syndrome as paranoid in its intensity, but lacking in excessive situational and self- 
deluding aspects of true paranoia. 

The low mean scores of the prison sample on the Responsibility and Dominance 
scales appear to indicate the lack of ability to conscientiously assume social responsi- 
bility, a lack of dependability, an unhealthy submissiveness in face-to-face contacts, 
and a susceptibility toward being easily influenced and intimidated by others. The 
high Dependency score of the prison sample implies a strong dependency on others, 
feelings of insecurity and inadequacy, and a lack of confidence in one’s ability to 
cope with the socio-economic demands of society. 

The Ego Strength scale, which was developed as a clinical indicator for predict- 
ing the responses of psychoneurotic patients to psychotherapy, has been reported 
by Barron“? as also useful in assessing effective personal functioning where an esti- 
mate of personal adaptability and resourcefulness is desired. The apparent degree of 
ego-dysfunction among the prison sample is reflected not only in the low Ego Strength 
score but also in the deviate scores on the Prejudice, Responsibility, Dominance, and 
Dependency scales, all of which measure to some degree the various aspects of 
effective functioning which are synonymous with the term ego strength. 


SUMMARY 


An investigation of the MMPI responses of 2,374 prison inmates on seven new 
MMPI scales designed to measure beliefs and attitudes reflecting the degree of ade- 
quate personal-interpersonal functioning and social adaptability, revealed that 
prison inmates score in the direction of inadequacy on those scales designed to meas- 
ure prejudice, responsibility, dominance, dependency, and ego strength. The mean 
scores of the prison sample on measures of social status and role playing ability did 
not differ significantly from the mean scores of normals. 
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A STUDY OF PERFORMANCE DIFFERENCES ON THE CATTELL 
INFANT INTELLIGENCE SCALE BETWEEN MATCHED GROUPS OF 
ORGANIC AND MONGOLOID SUBJECTS 


ANNE MARIE KRALOVICH* 
North Jersey Training School, Totowa, New Jersey 


PROBLEM AND METHOD 

The present study investigates inherent differences in test performance between 
a group of subjects medically diagnosed as brain damaged and a group of mongo- 
loids. In order to rule out differences which might result from physical incapacities, 
only those organic subjects who were not impaired in manipulatory movements 
were included in the present investigation. 

Twenty-eight subjects ranging from five to eight years (M = 6.5 yrs.) in chron- 
ological age were selected to comprise the group of subjects diagnosed by medical 
findings as organics. They were individually matched with a group of twenty-eight 
mongoloid subjects for age, sex and socio-economic status. Each group consisted of 
twelve males and sixteen females. The length of residence in the present institution 
ranged from one year to five years (M = 3.5 yrs.) for the organic group and one year 
to seven years (M = 4.5 yrs.) for the mongoloids. 

The study was conducted in the nursery unit of the North Jersey Training 
School at Totowa. This unit (the only one of its kind in the State) has a population 
of two hundred seventy-five children who are accepted for admission from infancy 
through the age of five when they are then transferred to various other state institu- 
tions for the mentally deficient. However, the lack of vacancies in other state 
institutions for the mentally deficient may prolong residence within the Nursery 
unit beyond the age limit of five years. Both male and female subjects are accepted 
for admission within this unit and physical impairment may range from mildness to 
severity in degree. 

The intelligence test uti'ized in the present study was the Cattell Infant Intelli- 
gence Scale which ranges from the 2nd month of mental development through Year 
IV. At Year III, the Scale links with items on the Stanford Binet, Form L. There 
are a total of 113 subtests in the entire Scale. From the 2nd through the 7th month, 
motor items are the main emphasis. At the 8th month, verbal concepts are intro- 
duced. However, motor reproduction continues to predominate until the 23rd and 
24th month when verbal factors become the main emphasis. The Scale was in- 
dividually administered under standard conditions in a test setting especially de- 
vised for such purposes. 


RESULTS 
Table 1 presents the mean MA, IQ and SD for the organic and mongoloid sub+ 
jects tested. The mean MA for the organic subjects is 1.2 years and for the mongo- 
loids it is 1.4 years. The difference of .2 years in MA is not statistically significant. 
The mean IQ for the organic group is 15.0 and SD 9.94. For the mongoloid group 
the mean IQ is 17.2 and SD 7.24. The mean difference in IQ and SD between both 
groups is not, however, statistically significant. 


TaBLe 1. Mean MA, IQ anv SD For THE ORGANIC AND MONGOLOID SUBJECTS 





Organics Mongoloids Difference 





Mean MA 1.2 years 1.4 years .2 years 
Mean IQ 15.0 17.2 2.2 
SD 9.94 7.24 2.7 





Seer is extended to the Research Staff at the Johnstone Training School, Borden- 


own, N. J., for their able assistance in the statistical design of this study. 
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Since no statistically significant differences were obtained between both groups 
in MA and IQ, it was decided to investigate differences in subtest performance. 
The Sign Test! was utilized in comparing performances on the 113 subtests of the 
Scale. From Table 2, it is apparent that 5 of the 113 subtests show significant 
differences between the groups beyond the .05 level of confidence and another 6 
subtests closely approach the .05 level. Those items which show highly significant 


TasLe 2. Svusrests Wuicu SHow SIGniFIcANT DIFFERENCES BETWEEN 
MoNGoLoIp AND OrGaANIC SUBJECTS 





Month Subtest Description of Subtest Level of Significance 





Transfers object from hand to hand 
Cube, secures 

Pellet, attempts 

Paper, exploits 

Ring, pulls by string 

String, manipulates 

Toy, uncovers 

Spoon-rattle, hits outside 

Square box, covers 


10 
10 
21-22 


ee DO Ot et 





“Significant beyond .05 level 


differences between the organics and mongoloids comprise motor manipulations 
which require simultaneous adductive and abductive hand movements. It is evident 
that the organic group has not been able to achieve this motor development and 
exploratory interest to the extent of the mongoloid group. The subtests which ap- 
proach near significance at the .05 level of confidence suggest furthermore that the 
organic group are not as facile as the mongoloids with motor items requiring grasping 
and manipulation of objects. 

In evaluating these results, it may be mentioned that the significant differences 
in test performance were elicited in the first twenty-two months of mental develop- 
ment on the Scale. This would include a total of 80 subtests. Beyond this level of 
mental development on the Scale, the number of subjects succeeding was too scant 
to secure an adequate sampling or comparison between the two groups. 


CONCLUSIONS 


The present study elicited significant differences in motor performances between 
matched groups of organic and mongoloid subjects. These differences are most 
highly significant between the mental age development of 5 to 10 months. Although 
the present study includes only 28 subjects in each group, it illustrates certain 
differences which might be investigated in a study of larger groups and suggests the 
possibility of devising a comprehensive motor performance scale which might dis- 
criminate organic children, under one year of age, who present problems of differ- 
ential diagnosis. 


ee ae Stoney. Non-Paremetric Statistics for Behavioral Sciences. New York: McGraw-Hill, 
, 68-71, 





SOME EFFECTS OF SOCIO-ECONOMIC, AGE AND SEX FACTORS ON 
CHILDREN’S RESPONSES TO THE ROSENZWEIG 
PICTURE SR USERATION STUDY 


ROBERT E. STOLTZ AND MARSHALL D. SMITH 
Southern Methodist University Tcxas Education Agency 


Several investigators have pointed out a number of factors which appear to 
influence the frequency with which certain types of responses will be made to the 
Rosenzweig Picture-Frustration Study (RPFS). Spache“? discovered that responses 
made to situations involving an adult and a child tend to differ from those made to 
situations involving only children. McCary®?, using subjects ranging in age from 
14 to 22 years of age, found geographic and racial as well as sex differences in res- 
ponse frequency. This present study attempts to determine the effect of age, sex 
and socio-economic factors on response patterns to the RPFS when the pictures are 
separated into adult-child (A-C) and child-child (C-C) situations. The specific 
hypotheses to be tested were: (a) there will be no significant differences between the 
responses of children of upper and lower socio-economic levels, (b) there will be no 
significant differences between male and female responses, (c) there will be no signi- 
ficant age level differences in response frequency, and (d) children will not respond 
significantly different in frustrating situations involving adults than in frustrating 
situations involving only children. 


PROCEDURE 

The Ss used in this study were 167 elementary public school students. Eighty- 
one of the Ss, 39 male and 42 female, were currently attending a school described by 
public school administrators as serving an upper-middle class community. Eighty- 
six of the Ss, 41 males and 45 females, were currently attending a school described 
as serving a lower class area. The Ss were grouped into three age levels: 8-9 years, 
10-11 years, and 12-13 years of age. The N for each of the twelve subgroups, based 
= he possible combinations of age level, sex and socio-economic level, are given in 

able 2. 

The Children’s Form of the RPFS was administered to the Ss in classroom 
situations. The same administrator was used for testing all of the Ss. Printed in- 
structions given by Rosenzweig were followed and in addition the examiner read all 
of the remarks of the persons shown in the test situations, encouraging the children 
to follow in their own test booklets. Tests were scored independently by two judges 
according to the instructions given by Rosenzweig®. In cases of disagreement be- 
tween the judges, the original responses were discussed until agreement was reached. 
Only those scoring categories dealing with direction of aggression and reaction type 
were considered in this study. 

In order to investigate the hypotheses it was necessary to divide the situations 
depicted in the RPFS into two categories: those involving adults and children 
(A-C) and those involving only children (C-C). Spache, in dividing the pictures on 
the RPFS, placed thirteen situations in the A-C category and eleven in the C-C 
category. However inspection of the pictures themselves and following Rosenz- 
weig’s verbal description of each situation, would indicate that it is more reasonable 
to place fourteen of the situations in the A-C category and ten in the C-C category. 
This latter method of division was followed in this study. The situations designated 
as C-C were the following: 2, 3, 8, 9, 12, 16, 18, 20, and 21. The remaining situations 
were classed as A-C situations. 

RESULTS 

Since scores for the RPFS dimensions are obtained in terms of frequencies, and 
the distributions of these frequencies are markedly skewed in many cases, and since 
the number of Ss in many of the comparison groups is small, non-parametric methods 
of data analysis were used throughout. Hypotheses a, b, and c were examined by 
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applying the Kruskal-Wallis one-way analysis of variance by ranks test to the data. 
The hypotheses were consideved separately for each scoring dimension and for each 
type of situation. The results of this phase of the study are shown in Table 1. 


TasLe 1. Vatues or H ror DIFFERENCES BETWEEN SCORES OF Major CoMPARISON GROUPS 











| Scoring Category 


Comparison Group | df I’ i M 


Adult-Child Situa- 
tions 
Upper vs. Lower ; 7.64** | 1.27 : ; : 8.65** 
Ms e vs. Female é 47 .00 ‘ B ‘ .63 
Age Levels : 5.53 9.36** | . ‘ : 4.58 


Child-Child Situa- 
tions 
Upper vs. Lower | 1 ‘ 19.62** | 9.29°*| . ‘ - d 4.54* 
Male vs. Female | 1 7 2.36 31 - , : .72 
Age Levels 2 77 4.82 .08 é ; 4.31 






































*Significant at .05 level. 
**Significant at .01 level. 


Hypothesis d was tested by applying the Wilcoxon Matched Pairs Signed Ranks 
test to the data to determine whether or not significant differences between responses 
to the A-C and the C-C situations existed for each subgroup. Because of the unequal 
number of situations in the two situation categories frequencies of response were con- 
verted to percentages before applying the Wilcoxon test. Table 2 summarizes the 
results of this phase of the data analysis." 


TaBLE 2. LEVELS oF CONFIDENCE OF DIFFERENCES BETWEEN Scores In ADULT-CHILD 
AND CHILD-CHILD SITUATIONS 








Scoring Categories 





| 


Grou l 
, e|e|r I i 





Age 8-9 
Female Lower ‘ 4 .01* | .02* 
Female Upper 2 ‘ -05¢ | .01* 
Male Lower 4 ‘ .01* 
Male Upper ‘ , .05* | .O1* 


Age 10-11 
Female Lower j ‘ .02* 
Female Upper j F ‘ d .01* 
Male Lower ; ‘ .O1* 
Male Upper : 7 .01* 


Age 12-13 
Female Lower Soe .01* 
Female Upper ~~ ‘ .O1* Olt 
Male Lower | . .05* | .01* 
Male Upper .05* | . .05* | .01* Olt 



































Note.—The labels “upper” and “lower” under the Group column indicate upper and lower socio- 
economic group respectively. The dash indicates that the comparison was not significant at the .05 
level or beyond. 

*Indicates more frequent responses in adult-child situations. 

{Indicates more frequent responses in child-child situations 


1To conserve space only the levels of confidence of the obtained values of T are presented in 
Table 2. Tables presenting prime data for this and other comparisons reported in this article have 
been deposited with the American Documentation Institute. Order Document No. 5749, remitting 
$1.25 for 35-mm. microfilm or $1.25 for 6 by 8 in. photocopies. 
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Discussion 


Hypothesis a was not confirmed for scoring categories E, I, and M for the A-C 
situations, or for scoring categories E, e, M, and m in the C-C situations. When the 
frustrating situation represents an adult-child relationship it appears that lower 
class children tend to project blame and hostility against the frustrator or some thing 
in the environment more frequently than do upper class children. Interestingly, the 
lower class children also appear to evade blame for the frustration more frequently, 
and tend to regard the situation as unavoidable. The upper class children, in response 
to the same A-C situations, tend to direct the blame and censure upon themselves 
more frequently than do the lower class children. Somewhat different effects are 
observed when the situations involve only children. While the lower class children 
still tend to project blame and hostility against the frustrator more frequently in 
C-C situations, the upper class children now tend to evade blame for the frustration 
or absolve the frustrator. This latter tendency is the reverse of what was found to be 
true for the stimulus figures involving adults and children. The upper class children 
also show more frequent mention of solutions expected from someone else and ex- 
pressions that time or normally expected circumstances will bring about a solution 
to the problem. It would also appear that patience and conformity are more char- 
acteristic of the upper class children in the situations involving only children and 
not in those situations involving adults and children. 

Hypothesis b was confirmed for all of the scoring categories for both types of 
stimulus situations. In other words, no significant sex differences in response fre- 
quencies were found. This would indicate that so far as verbalized responses to 
frustrating situations are concerned there are no consistent response patterns that 
could be traced to sex factors. McCary has reported sex differences, however it 
should be noted that he used an older group of Ss than was used in this study. It 
may well be that distinct responses to frustrating situations related to sex role identi- 
fication do not appear until after the ages of 12 to 13 years. 

Hypothesis c was not confirmed for scoring categories e and i for the A-C situa- 
tions and for categories I and i for the C-C situations. This may be interpreted as 
indicating that as the child increases in age, regardless of sex or socio-economic fact- 
ors, the child tends to increase the frequency of mention of responses indicating that 
a solution for the frustration is expected of someone else in A-C situations. Similarly, 
as the child matures it tends to offer more frequent amends in an attempt to solve 
the problem in both A-C and C-C situations. In addition as he matures the fre- 
quency of responses in which the blame and censure are directed upon himself tend 
to decrease. This latter response pattern would perhaps parallel the child’s gradual 
process of identification with a more adult role. 

As an inspection of Table 2 will reveal, Hypothesis d was confirmed only in part. 
This hypothesis suggested that children would not respond significantly different in 
frustrating situations involving adults than in frustrating situations involving only 
children. This hypothesis must be rejected for the scoring dimensions E’, E, I, and 
M. While a consistent pattern is difficult to determine, it would appear that certain 
subgroups differ sufficiently on dimensions I’ and m to suggest real differences. The 
evidence for rejection of the hypothesis completely for dimensions e, i, and M’ would 
not appear to be sufficient. Although there are cases of differences in response in the 
latter three groups the consistency of response is not sufficient to be able to isolate 
any general tendency of response patterning. The results indicated in Table 2 may 
be interpreted as indicating a general tendency for children to show a greater tend- 
ency to point out the presence of the frustrating obstacle in the A-C situations, to 
turn blame and hostility against some person or thing in the C-C situations, and to 
direct blame and censure upon himself in the A-C situations. There is also a fairly 
general tendency for the children to evade blame for the frustration altogether, that 
is regard the situation as unavoidable, more frequently in the C-C situations. The 
less well defined tendencies suggest that the type of response in which the frustrating 
obstacle is construed as not frustrating or in some way beneficial is more commonly 
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produced in the A-C situation and expressions of hope that time or normally expected 
circumstances will bring about a solution are more frequently produced in A-C 
situations. 


SUMMARY 


The Children’s Form of the Rosenzweig Picture-Frustration Study was admin- 
istered to 167 elementary school students. The Ss were grouped into 12 subgroups 
representing the possible combinations of three age levels, sex, and upper or lower 
socio-economic level. The 24 situations depicted in the test were divided into adult- 
child and child-child situations according to the stimulus elements in each of the 24 
situations. The effects of sex, age level, and socio-economic level on the responses 
produced in each of the two types of situations was determined. Significant effects 
of age level, socio-economic level and situation components were found. No signifi- 
cant sex effects were obtained. 
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PERCEPTION OF SELF AND OF NORMALITY IN SCHIZOPHRENICS": 2 
JOEN FAGAN® AND GEORGE M. GUTHRIE 
The Pennsylvania State University 


Theories of schizophrenia, such as those of Sullivan“, Arieti®’, and Cam- 
eron®), emphasize the importance of a poor self concept and a distorted view of 
other persons as important factors in the development of schizophrenia. This study 
attempts to explore these theories by investigating the types of self-perceptions and 
the types of conceptions of the normal person held by schizophrenics. 

The Q technique has been used to investigate the perceptions of self, ideal self, 
and average other person in normal and neurotic subjects. However there have been 
almost no comparable studies with psychotic subjects. Chase“) found that the 
concepts of ideal self and average other person were similar for adjusted and mal- 
adjusted subjects, and only the self sorts could differentiate between the groups. 
There are a number of studies concerning the normal subject’s ability to appear 
abnormal, but little research on the psychotic’s understanding of normal functioning, 
other than that of Grayson and Olinger “) who asked psychotics to answer the MMPI 
as a well adjusted person would. In an unpublished study the second author found 
that chronic schizophrenics were not able to produce normal MMPI profiles when 
asked to sort the cards, ‘‘as though you were perfectly well and didn’t have any 
troubles.” 


1Adapted from a dissertation submitted in partial fulfillment of the requirements for the degree 
of Doctor of Philosophy, The Pennsylvania State University, 1958. The senior author wishes to thank 
committee members, Dr. William 8. Ray and Dr. Lester Guest and the Psychology Staff at the Leech 
Farm Veterans Hos: vital, Pittsburgh, where the data were collected. 

*Published with the permission of the Chief Medical Director, Department of Medicine and 
Surgery, Veterans Administration, who assumes no responsibility for the opinions expressed or con- 
clusions drawn by the authors. 

3N ow at the Mental Hygiene Clinic, Veterans Administration Regional Office, Atlanta, Georgia. 
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This study was concerned with the variety of self-perceptions found among 
schizophrenics and with their one or more conceptions of normality. This material 
should also shed light on the distortion of interpersonal relations characteristic of 
schizophrenia. Since it is often suggested that the schizophrenic has a marked de- 
ficiency in role playing skills, this method should indicate what roles he can take 
when asked to take the role of an average person. Extended further, his perception 
of normality may have considerable therapeutic and /or predictive utility. 


METHOD 


The method of transposed factor analysis was chosen to identify the patterns of 
self concepts within a group of patients. In order to select and structure items for the 
Q sort, an outline of ways of perceiving the self and other people, and ways of relating 
to others was formulated, based on Arieti’s and Sullivan’s theories. Descriptions of 
healthy ways of perceiving and relating were also included to make the sort more 
representative of a wide range of behavior. The outline indicating item content is 
given below: 


I. Conceptions of the Self 
A. Weak, unable to succeed or perform well, grandiose 
B. Bad, guilty, over-moralistic 
C. Confusion of identity, lack of control, over-control 
D. Able, strong, self-confident, good, stable 


II. Conceptions of Others 
Strong, demanding, controlling 
Rejecting, cold, distant 
Hostile, punitive 
Having bad motives and morals, unfair 
Confusing, ambivalent 
Good, accepting, warm 


III. Ways of Relating to Others 
A. Hostile, aggressive, rebellious 
B. Isolated, withdrawn, emotionally detached 
C. Dependent, compliant 
D. Able to form good, affectionate relationships 


One hundred and fifteen items developed within this framework were admin- 
istered twice to 23 hospitalized male schizophrenic veterans. The reliability of the 
item placements and the interpersonal variability of the placements were calculated, 
and 84 items, shown in Table A‘, were chosen for inclusion in the final Q sort. 

The subjects used in the final testing were 20 schizophrenic male veterans who 
were white native-born Americans with no known neurological damage and no recent 
insulin or electric shock treatments. They were given the items on consecutive 
days, with each subject being asked the first day to place the items into a seven 
point, approximately normal distribution ranging from very true of me to very false 
of me so as to describe himself. The following day he was asked to sort the items as 
he thought an average person would if asked to describe himself. The patients were 
shown a picture of a man of similar age and social class. Instructions were as follows: 
“‘Here is a picture of an average guy. Now I want you to pretend you’re him, and 
answer the statements the way you think the average person would answer them.”’ 
Normality in this study implies average rather than ideal, although some schizo- 
phrenics probably have a hard time differentiating these two concepts. The in- 


‘The two matrices of intercorrelations, the two centroid matrices, the two rotated matrices, the 
Q sort items with reliabilities and variabilities, and the items with the largest correlations with each 
factor have been deposited with the American Documentary Institute. Order Document No. 5735 
remitting $2.00 for microfilm or $3.75 for photocopies. 
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structions should be noted since there may be some differences of response depending 
on whether the patient is asked to sort ‘‘as if you were normal and well adjusted”’ or 
is asked to sort “the way an average person would.”’ There is clinical evidence that 
the schizophrenic has difficulty conceiving himself as different than he is. 


RESULTS AND DIscUssION 

A total of 190 Pearson product moment correlations was calculated from the 
20 self sorts and a similar matrix was obtained from the normal sorts. The two 
matrices were then factor analyzed. Five factors were extracted from the self matrix 
and four from the normal matrix, and rotated to simple structure. In order to inter- 
pret the factors, the rotated factor loadings were correlated with the subjects’ 
responses to each item. The items with the highest correlation with each factor were 
selected to aid in naming the factors, These results are shown in Tables B through I.‘ 


Factors derived from the Self Matrix. Subjects with high loadings on Factor I, des- 
cribe themselves only in positive or healthy ways, indicating that they feel self- 
confident about their abilities, that other people accept them, and that they are 
able to relate well to others. Doubts and rejection by others are denied. This factor 
seems to represent an attempt to deny illness. The most highly correlated items 
report that they make friends easily and that they expect to succeed. Although the 
behavior of subjects with high loadings on this factor belies their self-descriptions, 
their ability to cover up their illness indicates some ego strength. This factor seems 
to represent defensive denial of maladjustment. 

Factor II, is characterized by feelings of worthlessness and self-doubts, many 
guilt feelings, and an unstable or confused self concept. The most highly correlated 
item was, ‘If I think someone is mad at me, it worries me a lot.’”” However, other 
people are described as kindly and fair, and their acceptance and regard is very im- 
portant. The unfavorable description of the self, and the need to be helped and 
liked by others are similar to Arieti’s concept of the ‘“‘bad-me’’. This factor represents 
a concept of the self as bad and inadequate with dependency upon others. 

Factor III, is largely concerned with conceptions of self, with little reference to 
perceptions of or relationships with other people. Subjects with high loadings on 
this factor indicate that control of emotions and impulses is very important to them. 
They deny guilt, mistakes, and feelings of failure. An examination of the subjects 
with high loadings indicates that their behavior is at variance with their descriptions 
of themselves. This factor therefore seems to represent a denial of feelings, but in 
contrast to Factor I, most of the emphasis is on self-perception. Factor III, is des- 
cribed as denial of guilt and self-doubts. 

Factor IV, is characterized by maladjustment in every area. Failure is expect- 
ed, self-doubts are strongly indicated, other people are seen as rejecting and aloof, 
and relating to others is very difficult. This is exemplified in the item, ‘I have met 
many people who acted like they didn’t want to have anything to do with me.”’ This, 
of all the factors, is the most similar to the descriptions of the schizophrenic given 
by most theories, with a poor self concept and an inability to relate to other people. 
This factor clearly indicates extensive maladjustment. 

Factor V, gives a less clear picture. Most of the items highly correlated with 
this factor concern perception of other people and ways of relating. For the most 
part, others are seen as confusing and ambivalent, yet subjects with high weightings 
on this factor feel that they can relate easily. Their relationships do not appear to 
be on a deep emotional level, however. This factor can be tentatively named super- 
ficial relatedness with little understanding of other people. 


Of the five factors, two deny unhealthy aspects of the self, two admit consider- 
able maladjustment, and one gives a mixed picture. These descriptions are similar 
to the clinical impressions given by psychotics, some admitting their illness freely, 
and some being able to deny any trouble, although their behavior seems to indicate 
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many problems. On the whole, the factors obtained from the self sorts give person- 
ality descriptions that correspond to the theoretical formulations of Arieti. These 
results also have implications for self-report measures of schizophrenia such as the 
Sc scale of the MMPI, suggesting that there is a need for a minimum of five scales to 
measure schizophrenia. 


Factors Derived from the Normal Matrix. The results of the analysis of the normal 
sorts are shown in Tables J through P*. The items most highly correlated with 
Factor I, indicate that some schizophrenics describe the normal person as seeing 
other people as accepting and understanding. Relationships with others are gener- 
ally healthy, although there are indications of compliance and dependency. Almost 
nothing is indicated about the self-perception. The emphasis on relationships and 
submissiveness has considerable similarity to Fromm’s description of the marketing 
orientation? and also to Riesman’s concept of other-directedness“*) for which this 
factor is named. 

Subjects with a high loading on Factor II, perceive the normal person as feeling 
aggressive and self-assertive. Other people are seen as friendly and companionable, 
but there is no deep emotional attachment or involvement. This description of 
surface relatedness, with emphasis on self-asserting and dominance seems similar to 
Fromm’s exploitive orientation, and is named aggressive individualism. 

Factor IIL, describes the normal person as having much self-assurance, being 
able to relate to others easily and well, and generally enjoying life. This seems to be 
quite a healthy description with a good balance between strong self concept and 
confident dealings with other people, and can be called exuberant self-confidence. 

Factor IV, characterizes the normal person as having self-doubts, little con- 
fidence, and a lack of self-assertion. There is much compliance with the wishes of 
others and dependence upon them for guidance and direction. This factor is similar 
to Fromm’s receptive orientation, and is named timid dependency. 


The four factors derived from the schizophrenics’ descriptions of the normal 
person seem to have much similarity to cultural stereotypes of normalcy, and are 
also closely related to descriptions of personality types given by Fromm. Only 
Factor IV, seems to reflect the pathology of the subjects to any marked extent. It 
would seem that schizophrenics are able to verbalize socially accepted concepts con- 
cerning the ways a person should feel and behave. 


RELATIONSHIPS BETWEEN THE SELF CONCEPT AND DESCRIPTIONS 
or NoRMALITY 

The five factors obtained from the self sort were correlated with the four factors 
describing perceptions of normality to determine if subjects who described them- 
selves in similar ways tended to perceive the norma! person similarly. Of the twenty 
correlations between the five self factors and the four normal factors, only one was 
greater than .40, indicating that subjects who tended to describe themselves in 
similar ways did not view the normal person alike. The correlations between the 
self sort and the normal sort for each patient ranged from —.17 to .65, indicating 
that they did not perceive themselves as similar to their concept of the normal per- 
son. The schizophrenic does know and will report that he is different from the 
stereotypes of normalcy which he apparently shares with nonschizophrenics. 


SUMMARY 
The responses of 20 schizophrenic patients to a Q sort including items describing 
perceptions of self and other people and ways of relating to others were correlated 
inversely and factor analyzed, resulting in five factors: I, Defensive denial of mal- 
adjustment; II, Bad and inadequate with dependency upon others; III, Denial of 
guilt and self-doubts; IV, Extensive maladjustment; and V, Superficial relatedness 
with little understanding of other people. 
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Responses of the subjects using the Q sort to describe the average person were 
correlated and factor analyzed, and four factors were extracted: I, Other directed- 
ness; II, Aggressive individualism; III, Exuberant self-confidence; and IV, Timid 
dependency. 

Correlations between the factors obtained from the self and normal sorts 
indicated little relationship between the way patients described themselves and the 
way they viewed the normal person. Correlations between self and average person 
sorts for each patient indicate that these schizophrenics do perceive their discrep- 
ancy from what they believe to be normal. As with neurotics, schizophrenics differ 
most, not in their conception of the average other, but in their perception of them- 
selves. 
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PREFERENCE FOR ABSTRACT VERSUS CONCRETE WORD 
MEANINGS IN SCHIZOPHRENIC AND BRAIN-DAMAGED 
PATIENTS! 


NORMAN ALVIN MILGRAM? 
Boston University 


It is generally held that of the recognized cognitive abilities, vocabulary fund is 
least susceptible to disturbance and is, therefore, the most stable index of premorbid 
functioning in disturbed persons“: "1, It is also generally held that schizophrenic 
and brain-damaged patients differ from normals in the use and understanding of 
words. §. 12, 4, 15). There is, however, no necessary contradiction between these 
statements. Yacorzynski“®, Goldstein® »». "-"), Rapaport ® »- “) and others 
have suggested that the stability of word usage in clinical populations is more ap- 
parent than real. As Yacorzynski stated the issue, “It appears reasonable to assume 
that the reason why the vocabulary of the deteriorated individual remains relatively 
unchanged is because easier methods of reaching the same end results are left to the 
organism even if the more difficult solutions are no longer available.” “* ». 26) 
Several experimental studies °: *: 7) have confirmed ‘‘Yacorzynski’s hypothesis” and 
have shown that while gross word definitions are adequately retained, the more 
subtle, conceptual usages of words are impaired. 

The present study enlarges upon an earlier investigation by Flavell“) who 
found that in a special test situation in which S is asked to select the one of two choice 
words which is “closest in meaning” to a given stimulus word, normals selected 
words bearing an abstract or essential meaning relationship to the stimulus word 
more often than schizophrenics. Flavell’s investigation, like the majority of others 
in this area, dealt with the verbal performance of schizophrenics alone. An investiga- 


1This study was a portion of a doctoral dissertation submitted to Boston University, 1958. The 
writer is indebted to Drs. Berkeley, Goodglass and Weinberg for their st throughout the study. 
The writer is especially grateful to Dr. John Flavell whose original work was the stimulus for the 
present work. 

*Now at the Nebraska Psychiatric Institute, Omaha, Nebraska. 
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tion of the verbal performance of brain-damaged patients and a comparison of their 
performance with that of schizophrenics appear warranted both from empirical and 
theoretical points of view. 


MerTHoD 

The present study compares the preferences of schizophrenic, brain-damaged 
and normal Ss for abstract or essential versus concrete or non-essential word mean- 
ing relationships. Although the three groups will be equated on vocabulary fund, 
systematic differences in word usage are expected to emerge, 7.e., normals will select 
abstract meaning relationships more often than schizophrenics or brain-damaged 
patients. In Flavell’s study significant differences between schizophrenics and 
normals were found only for the total abstract score, but not for the separate ab- 
stract or concrete categories. Certain changes from Flavell’s procedure are being 
introduced in the present study to accentuate the preferential weight given each of 
the separate abstract and concrete word meaning categories by the three groups. 


Subjects. Three male groups consisting of 32 schizophrenics, 30 brain-damaged 
patients, and 20 normals were used. The criteria for selection of the schizophrenic 
group included (a) a psychiatric diagnosis of schizophrenia (paranoid, 24; hebe- 
phrenic, 2; catatonic, 2; and chronic, undifferentiated, 2) and (5) no neurological 
involvement, no somatic therapy (insulin coma or electric shock) within the past 
six months, and no history of lobotomy. 

The criteria for the selection of the brain-damaged group included (a) a neuro- 
logical diagnosis of damage to the cerebral cortex (vascular disease, 9; degenerative 
disease, 12; chronic brain syndrome, 9) and (b) no previous history of severe mental 
disturbance and no evidence of psychosis at time of testing. 

A final criterion for inclusion in any subject group was the willingness and 
ability of S to attend to instructions and to follow through with the demands of the 
testing situation (of which the data reported in the present paper are a small por- 
tion). Excluded from this study were litigious paranoid patients and mute catatonic 
patients who refused to cooperate; also excluded were grossly impaired brain- 
damaged patients, especially patients with severe aphasia. 

The two clinical groups were drawn from the psychiatric and neurological 
wards of two hospitals of the Veterans Administration. The normals were drawn 
from the medical wards of one hospital and from the general population outside the 
hospital. The three groups were equated on age and vocabulary fund as measured 
by the Vocabulary Subtest of the Wechsler-Bellevue; the vocabulary scores are re- 
ported as IQ scores. Data on age, years of formal education, and IQ are shown in 
Table 1. A statistically significant, but small, difference between normals and the 
two clinical groups in years of formal education was observed. 


Tass 1. Acs, IQ anp Years or ForMAL Epucation 








Group Age IQ Education 





N 12.0 


44.6 

(15.2) (2.7) 
8 43.1 10.8 
(13 .4) (3.2) 


45.6 11.5 
(15.2) (3.2) 














PROCEDURE 
This test (Fig. 1) was taken from Flavell“) with some modifications in pro- 
cedure and new word items, and consisted of 48 items each containing a stimulus 
word and two choice words. All the stimulus words were nouns; the choice words 
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Fic. 1. Worp Mranrina Test (Tae Asstract RESPONSES ARB ITALICIZED) 








TUNE melody(S) - sing(V) 24. oven(S) - hot(A) 
. ELECTRICITY Sees) - energy 25. canoe(SB) - row(V) 


) 26. VIOLIN strings(PW) - fiddle 
. PARCEL package(S) - wrapper (S) 
PW 27. FLOWER ene - rose(SB) 
. BOOK page(PW) - novel(SB) 28. WATER iquid(SP) - drink(V) 
SALT spice(SP) - sprinkle 29. EGGS hen(CC) - food(SP) 
(V) 30. KING crown(CC) - monarch 
HOBBY interesting(A) - 8 


(S) 
stamps(SB) . TREE owe - branch 
. HARBOR ships(CC) - bay(S) Ww) 
. HOUR — - minute . CAMERA eqn SP - lens 


) 
. RABBIT ears(PW) - bunny(S) . WEIGHT Soop - ton(SB) 
. KNIFE ao - blade . WINTER season(SP) - cold(A) 
(PW) 5. WAGON wheel(PW) - cart(S) 
. BLANKET warm(A) - cover(S) . CANDY gumdrop(SB) - sweet 
. DOG dle(SB) - bark(V) A 
. SICKNESS hospital(CC) - illness . STRING tie(V) - twine(S) 
(S) . BREAKFAST morning(CC) - meal 
parent(SP) - dear(A) (SP 


nest(CC) - robin(SB) . SALOON tavern(S) - noisy(A) 
steps(S) - climb(V) . RIVER tires -flow 


head(CC) - clothing (V) 
. WORLD globe(S) - round(A) 


OI 2D ar wO Ne 


(SP) 

. DREAM nightmare(SB) - sleep . BUREAU drawer(PW) -furni- 
CC ture(SP) 

. MOUNTAIN high(A) - Rockies(SB) . LAKE swim(V) - pond(S) 

. SHIRT garment(SP) - sleeve . POCKETBOOK purse(S) - money(CC) 
(PW) . MOON shine(V) - planet(SP) 

. SILK an - material . CONTAINER boz(SB) - top(PW) 


. GRANDMOTHER old(A) - relative(SP) 
. BACON meat(SP) - fry(V) 8. DIRECTION north(SB) - compass 
. AUTO drive(V) - Ford(SB) (CC) 





were nouns, adjectives, or verbs. On a given item, the stimulus word was related to 
one of the response words in one of three abstract ways and to the other response 
word in one of four concrete ways. The seven different types of relationship between 
stimulus and response words were as follows: 


(1) Synonym (Syn), e.g., “car” - “auto.” 

(2) Supraordinate (Supra), e.g., “car” - “vehicle.” 
(3) Subordinate (Sub) e.g., “car” - “Ford.” 

(4) Part-Whole (PW), e.g., ‘car’ - “fender.” 

(5) Adjective (A), e.g., “car” - “flashy.” 

(6) Verb (V), e.g., “car” - “drive.” 

(7) Concrete Context (CC), e.g., “car” - “garage.” 


Of the above seven categories, the first three are considered abstract, essential mean- 
ing relationships; the last four are concrete, nonessential meaning relationships. 
Each of the abstract categories was paired with each of the concrete categories four 
times, to make a total of 48 items. In a given item, the stimulus word is related 
abstractly to one choice word and concretely to the other.* 


‘This represents a departure from Flavell who paired each of the eight categories (he used a 
Homonym category which I discarded in the present study) to each of the other seven, e.g., some items 
were Syn-Supra and PW-CC. As a result in his study Ss chose between two abstract or two concrete 
response alternatives on over 40% of the items. In the present study Ss chose only between an ab- 
stract and a concrete response alternative on all items. This change was made to maximize the hy- 
pothesized deficit of the experimental groups in choosing responses from each of the separate abstract 
and concrete categories. 
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Subjects were tested individually and were instructed to read aloud the stimulus 
word and to select from the two choice words the ‘‘one that means most nearly the 
same” as the stimulus word. The maximum score for any one abstract category 
was 16; for a concrete category, 12. Thus, for example, if a S chose only abstractly 
related choice words, his abstract score was 48. 


RESULTS AND Discussion 
The results summarized in Table 2 confirm the hypothesis that normals select 
abstract word meaning relationships more than schizophrenics and brain-damaged 
patients. It is also observed that schizophrenics are significantly superior to the 


TABLE 2. MEANS AND STANDARD DEVIATIONS OF ScoRES OF ABSTRACT 
AND CONCRETE CATEGORIES 











oy P 
.| N > Schiz. Schiz. > BD 
< .01* < .001 


2 
5 


Category Normals 
Total Abstract Score 





& 8 


< .001 <.001 


M 
SD 
Synonym 


BS SB 


D 
Supraordinate ” 


< .05 < .02 


Sy 


i 


Nm Ft NO Kt We Kk lf 


Subordinate a < .05 < .02 


it et 
wo 


Part-Whole 


38 
RS 


Adjective 


M 
SD 
Verb M 
SD 
M 


Concrete Context <.01 


noe NWO NF NWO NO We Ww 


BS SR SB BR 
28 ¥8 83 


ee $8 Bs 


8 











*The ¢t Test with Snedecor’s Correction was used because the assumption of homogeneity was 
untenable for the eight comparisons made above. 


brain-damaged patients in this regard. The total mean scores for the three groups 
were 42.60, 38.38, and 31.74, respectively. With respect to the separate response 
categories, normals and schizophrenics chose significantly more abstract and fewer 
concrete words than did brain-damaged patients; significant differences between 
normals and schizophrenics were obtained on all but two categories, PW and CC. 
It is concluded that a person may retain the ability to summon up creditable defini- 
tions to words without retaining the ability to discriminate the abstract-concrete or 
essential-nonessential continuum of word meaning. This conclusion applies to both 
experimental groups, but more especially to the brain-damaged group. Only five of 
the 30 brain-damaged patients received scores as high or higher than the lowest 
score obtained by a normal; the overlap between schizophrenics and normals was 
considerably greater with 19 of 32 schizophrenics scoring higher than the lowest 
normal. 

Rank-order correlation coefficients were computed for the three groups between 
age and abstract scores and were significant only for the schizophrenic group (.30, 
significant at .05 level, one tail test). The explanation for this inverse relationship 
between verbal age and abstractive ability may lie in the fact that chronicity of illness 
and hospitalization in the schizophrenics are a direct function of age. The older 
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schizophrenics have had lengthy hospitalizations, while the younger schizophrenics 
have suffered from disturbances of relatively recent onset. Hence, the older, more 
chronically ill and deteriorated patients failed to do as well in the conceptual use of 
words as the younger, more acute patients. This explanation is consistent with 
Flavell’s finding of a positive relationship between abstractive ability and adequacy 
of everyday social interaction. While no measures of social interaction were made in 
this study, clinical observation showed the older, more chronically ill schizophrenics 
to be in less adequate social contact and interaction than the younger schizophrenics. 
Rabin, King, and Ehrmann”? have also showed that long-term schizophrenics were 
significantly lower than normals and short-term schizophrenics on three measures 
of vocabulary performance. 

Rank-order correlation coefficients between IQ and abstractive ability were also 
computed. These correlations were highly significant (.001 level) for the normals and 
the schizophrenics (.67 and .77 respectively), but were not for the brain-damaged 
group (.25). These findings further highlight the discrepancy in the brain-damaged 
group between the ability to retain the approximate definitions to words and the 
ability to use words conceptually. For the normals and schizophrenics, a high vocab- 
ulary score is generally associated with a high abstract score; for the brain-damaged 
patients a high score in vocabulary may be earned by a person with considerable 
deficit in cognitive abilities as a result of which he earns a low score on a verbal con- 
ceptual task. 

When a comparison is made of the response hierarchy of each group, it is noted 
that Syn, Supra, and Sub were chosen in descending order of frequency by the three 
groups, except that the difference between Syn and Supra did not reach significance 
for the two clinical groups. Whether this represents the hierarchy of meaning re- 
lationships in the general population of adult thinkers is unanswered. It should be 
noted, however, that Flavell observed the same hierarchy with his groups, although 
his items and scoring procedure‘ differed from those used in the present study. 

Turning to the concrete categories, no such hierarchy prevailed. V was chosen 
least frequently by normals, of the four concrete categories. A was chose most fre- 
quently by the two clinical groups and CC was chosen second in frequency to A by 
the brain-damaged group. All other differences were not significant. A possible ex- 
planation of the preference for A is that adjectives have a strong sensory stimulus 
value; normal adults are able to inhibit the tendency to respond to this stimulus and 
are able to follow the logically determined habits of word meaning; schizophrenics 
and brain-damaged patients are less able to inhibit this tendency, hence they are 
more likely to choose an adjective over an abstractly related word than they are to 
choose one of the other types of concretely related word categories. 


SUMMARY 


The present study compared the preferences of schizophrenics, brain-damaged 
patients, and normals for abstract or essential versus concrete or nonessential word 
meaning relationships. Although these three groups were presumably equated in 
ability for defining words (Vocabulary Subtest scores), both schizophrenic and brain- 
damaged patients chose fewer abstractly related words than normals. This disturb- 
ance in the abstract-concrete continuum of word meaning was greater and more 
consistently observed in the brain-damaged group than in the schizophrenic group. 
Results were interpreted as supporting ‘‘Yacorzynski’s hypothesis’. 
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ATTITUDES OF MOTHERS OF SCHIZOPHRENIC PATIENTS!? 
IRMA LEE SHEPHERD® AND GEORGE M. GUTHRIE 
The Pennsylvania State University 


Sullivan®®, Arieti®, and Fromm-Reichmann™ have emphasized the im- 
portance of the mother-child relationship in the development of schizophrenia. 
From case histories, interviews, and the reports of schizophrenic patients in therapy, 
these mothers have been variously described as overprotective, subtly and overtly 
rejecting, domineering, and as anxious and inadequate individuals who use their 
children to fulfill their own needs at the expense of the child’s development as a 
person in his own right ®: * ® 7. % 19 14, 12, 13,18) Such reports are helpful but limited 
in value by their overgeneralization and the theoretical biases of the investigator. 
Some of the studies give the impression there is only one particular pathological 
pattern of mother-child relationship. Other studies report differing types of relation- 
ships. In a particularly relevant study, Hotchkiss“, through direct observations of 
22 mothers of hospitalized single male schizophrenic patients, found five types of 
mothers: (a) oversolicitous, hovering, domineering; (b) sweet, indulgent; (c) seduct- 
ive, flirty; (d) intent, controlled, quietly purposeful; (e) removed, non-participating, 
ignoring. Since no single pattern has been proven, but rather several patterns ob- 
served, it seems desirable to identify as clearly as possible the various types of mother- 
ing that schizophrenic patients may have experienced. 

The purpose of this study is to investigate the attitudes of mothers of schizo- 
phrenics toward children and family life in order to delineate patterns of mother- 
hood which may be associated with the development of schizophrenia in children. 


1This article is based upon a dissertation submitted as partial fulfillment of the requirements for 
the degree of Doctor of philosophy at The Pennsylvania State University. 

*Published with the permission of the Chief Medical Director, Dept. of Medicine and Surgery, 
Veterans Administration, who assumes no responsibility for the opinions expressed or conclusions 
drawn by the authors. 

’Now with Pupil Personnel Services, Board of Education, Atlanta, Georgia. 
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A factor analytic approach is used since it offers an effective method of placing sub- 
jects into groups showing common attitudes, from which basic patterns of mother- 
hood can be seen and conceptualized. 


METHOD 

The instrument used in this study was developed from the Schaefer and Bell 
pool of items for research on parental attitudes“*. Items were selected which repre- 
sented a broad coverage of attitudes toward parent-child relationships and family 
life including the mother’s perception of herself and her role as wife and mother, her 
perception of and expectations of children, and her ways of relating to them. A 
pilot study was carried out by administering 130 items on two occasions to 17 moth- 
ers of male schizophrenics. From these 130 items, 100 which had the best reliability 
and interpersonal variability were chosen for inclusion in the final study. They 
are presented in Table A‘. 

The subjects were 20 white, American-born mothers of male schizophrenic 
patients in a Veterans Administration neuropsychiatric hospital. These women 
ranged in age from 42 to 65 years with a mean age of 54, and in education level from 
5th grade to one year of business school beyond high school. The mean grade level 
was 8.7. Estimated socio-economic level was lower middle and middle class. The 
subjects were asked to sort each item to indicate whether they strongly agreed, 
mildly agreed, mildly disagreed, or strongly disagreed with the content. 

Pearson r’s were computed between each pair of the 20 subjects. The 20 X 20 
matrix was factor analyzed by Thurstone’s centroid method ®® and the resulting 
arbitrary reference axes were rotated to simple structure. 


RESULTS AND DIscussION 


The rotated factor loadings on the five factors are shown in Table D‘. These 
loadings were correlated with each subject’s responses to the items and the fifteen 
items with the highest correlations with a factor were used for the interpretation of 
that factor. These are shown in Tables E through I.‘ 


Factor I: Detached authoritarianism. A subject with high loading on this factor was 
primarily concerned with herself as a mother with little emphasis on her child and 
her way of relating to him. She had a high level of anxiety and insecurity about the 
role of mother and perhaps even unconscious hostility toward the child. To assuage 
her anxiety she demanded perfection of herself in order to reach an exalted ideal of 
motherhood that must be constantly proven and reassured by the child’s complete 
loyalty, gratitude and esteem. She could not allow the child democratic considera- 
tion in the family since this would weaken her authoritarian role toward him. She 
related passively to the stronger authority figure of her husband even to the extent 
of failing to protect the child. In turn, she acquired a sense of self-esteem from the 
exaltation and obedience of the child. Such a relationship with the child who was 
used mainly to enhance the mother’s needs precluded any deep emotional involve- 
ment. This factor seems to resemble the authoritarian personality as well as the 
patterns of mothering found by Tietze “* in her study of the mothers of schizophren- 
ics, a the type Hotchkiss“ observed and named intent, controlled, quietly pur- 
poseful. 


Factor II: Inadequacy and inconsistency. This factor seems to describe the mother to 
whom motherhood was primarily a sacrifice of all her own enjoyment. She felt in- 
secure and ill-equipped for the responsibilities thrust upon her, and felt she got little 
help from her inconsiderate and selfish husband. She allowed herself few social con- 


‘The correlation matrix, the centroid factor matrix and the items have been deposited with the 
American Documentation Institute. Order Document No. 5736 from ADI Auxilary Publications 
Project, Photoduplication Service, Library of Congress, Washington 25, D. C., remitting in advance 
$2.50 for photoprints, or $1.75 for 35 mm. microfilm. Make checks payable to Chief, Photoduplication 
Service, Library of Congress. 
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tacts which in turn limited the early social experience of the child. Her perception 
of children was inconsistent as was her way oi relating to them. On the one hand, 
children, regardless of age, are thought of as their mother’s babies; on the other, 
they are seen as so willful and headstrong that the mother is unable to cope with 
them or set limits. Thus she dealt with the child inconsistently and ineffectually 
which could only be confusing to the child, and interpretable as subtle and covert 
rejection. The passivity of the father and the dominance of the mother are similar 
to descriptions given by Kohn and Claussen“, and Gerard and Siegel“. This 
factor seems similar to the mothers Hotchkiss described as sweet and indulgent ©. 


Factor III: Pervasive control. High weightings on this factor indicate a strong empha- 
sis by the subject on maternal control of every aspect of the child’s life. Her expecta- 
tions included early toilet training, wise use of his time, and respect for his parents 
above all other adults. She allowed him no privacy, even of thoughts. Aggressive- 
ness was suppressed and parents or teachers were to be depended upon to solve prob- 
lems. Her own embarassment and anxiety about sex would not permit her to answer 
her child’s questions. In contrast to her demands for the child’s achievement and 
conformity, her own needs tended to keep him dependent upon her even into adult- 
hood. She avoided tenderness and affection. She tended to feel quite competent in 
the mother role and probably expended considerable energy in controlling and direct- 
ing every aspect of the child’s life. This factor resembles the ‘“‘schizophrenogenic”’ 
pattern of motherhood described by Fromm-Reichmann®? and Arieti“; also the 
overprotectiveness factor described by Schaefer and Bell“®, and Levy®. Toa 
large extent it corroborates the maternal picture drawn in the Mark study“*, and 
the type Hotchkiss called oversolicitous, hovering, and demanding“. 


Factor IV: Sophisticated denial of inadequate mothering. The 15 most highly cor- 
related items were all negative. All difficulties in relating to and handling children 
were denied, as were marital conflict and feelings of inadequacy in the maternal roie. 
Investigation of the five subjects whose highest loadings were on Factor IV showed 
them to be above the mean of the group in education, and business and social exper- 
ience. In view of the findings, it is possible that these women had acquired a higher 
level of sophistication concerning approved child-rearing theories and attempted to 
give the right answers. 


Factor V: Annoyance and rejection. Factor V includes subjects who indicated strong 
feelings of dissatisfaction with the role of the mother. They were dependent, anxious 
individuals who were easily irritated by their children. Motherhood was perceived as 
a sacrifice which the children must repay by being considerate, grateful, and by de- 
manding as little as possible. To achieve this, children were discouraged from bother- 
ing their parents with their problems and troubles. Annoying behavior was dealt 
with inconsistently by harsh measures or by ignoring the child. In contrast to Factor 
III, these mothers did not foster dependency in the child by overprotection, but 
preferred that the child go his own way and not annoy them. Their lack of concern 
was emphasized in their ability to deny feelings of guilt if the child were hurt acci- 
dentally. Sex was a disturbing factor to these mothers who regard sex play in children 
tantamount to becoming adult sex criminals. Most of the ways of relating described 
by this factor imply annoyance and rejection. This factor resembles the pattern of 
maternal rejection described by Symonds“* and Schaefer and Bell’s factor named 
Hostile Rejection of the Homemaking Role“, and the type Hotchkiss described as 
removed, non-participating ©. 

The parallel between these findings and those of Hotchkiss “) warrants further 
comment. Four types are common to the results of both studies. Hotchkiss found 
only one seductive and flirty mother. Factor IV in this study has no parallel in the 
other since the methods of data collection were different. This similarity of results 
between inverse factor analytic methods and traditional clinical-observational pro- 
cedures is rather convincing evidence that some of the subtleties of dynamics are not 
lost in a calculating machine. 
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SUMMARY 


One hundred items on attitudes toward children and family life were admin- 
istered to 20 mothers of hospitalized male schizophrenics. The correlations between 
the persons were factor analyzed, and orthogonal rotation revealed five factors 
identified as Detached Authoritarianism, Inadequacy and Inconsistency, Pervasive 
Control, Sophisticated Denial of Inadequate Mothering, and Annoyance and Re- 
jection. It is felt that the results of this study indicate the probability of several 
“‘schizophrenogenic” patterns of motherhood. 
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PERCEPTION OF SUCCESS OR FAILURE IN PICTURES OF OTHERS! 


ROBERT ROSENTHAL 
University of North Dakota 


Jones has remarked that ‘“‘appraisal of people is one of our commonest social 
activities.”’®- ». 8) It is also one of the most important activities since we make 
countless decisions both trivial and crucial on the basis of these appraisals. The 
purpose of this study is to examine the effect of level of aspiration, of being a college 
male, a college female, or a paranoid schizophrenic on the process of perceiving others 
as more or less successful or unsuccessful as judged from photographs. 


METHOD 

Ss included 18 college men and 18 college women drawn from a class in intro- 
ductory psychology and 18 hospitalized male paranoid schizophrenics. The Cassel 
Group Level of Aspiration Test “’ was administered to all Ss and on the basis of their 
mean (aspiration) discrepancy scores, Ss were called high aspirers if they ranked 
among the nine high scorers for their group. The nine low scorers became the low 
aspirers. 

Each 8 was then shown a series of photographs and told that the people pictured 
had been experiencing success or failure. Ss were asked to rate each photo for degree 
of success or failure experienced on a rating scale which ran from +10 very successful 
to —10 very unsuccessful. The photos themselves were selected for relative neutrality 
from old issues of a weekly news magazine. The analysis was then replicated with 54 
different Ss, 18 in each group, using a different set of 14 photos. The score for each 
S and the data going into the analysis of variance was the sunrof his 14 ratings. 


RESULTS 

There was no difference in either analysis in the perception of the success or 
failure of others between the high and low aspirers. In both analyses however there 
was a significant difference among the groups (F = 11.8, p = .001; F = 5.03, 
P = .02) for the two analyses respectively. In both analyses the college men and 
college women did not differ significantly from each other but both differed sig- 
nificantly (Ps = .05, .01, .01, .01) from the paranoid patients who tended to see 
others as relatively more successful. Furthermore, the paranoid group tended to 
rate somewhat more extremely than did the college population. This was suggested 
by the Fs which were based on unbiased estimates of the population variances; the 
p levels were based on the two tailed test, and reached an adequate level of con- 
fidence (.002) in only one of the analyses. 

Although the aspiration effect was not found significant in either analysis of 
variance, it appeared to E while the paranoid Ss were being tested that those who 
scored high mean discrepancy scores tended to rate the photos as more successful 
than did the lower aspiring patients. Using z transformations the correlations for 
each of the three groups were averaged for the two analyses. For the 36 paranoid 
patients, the r = +.54 was significant at the .01 level. The correlations (r) of -.08 
and —.25 for the 36 college men and the 36 college women respectively, were not 
significant. 

Miller“ found that the (level of aspiration) mean discrepancy scores of para- 
noid schizophrenic patients did not differ significantly from normals’ scores. Our 
results confirmed this finding. The mean of the mean discrepancy scores for the 
paranoids was +2.37 (SD = 6.31), for the college men the mean was +1.04 (SD = 
1.72) while for the college women the mean was +1.11 (SD = 1.74). None of the 
t’s were found to be significant. Jost“) found more variability in aspiration measures 


1This study is a fresh analysis of a portion of the data collected for the doctoral dissertation pre- 
ared under the guidance of Professors B. Klopfer and J. A. Gengerelli, and submitted in partial ful- 
ent of the requirements for the Ph.D. degree, U. C. L. A., 1956. 
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among his schizophrenic Ss than among his normals and this finding too seems con- 
firmed by our results. The estimate of the variance of the paranoid group was more 
than 13 times the estimates of the variances of both the college groups. With 35 and 
35 df this F was significant at well beyond the .001 level. The range of the mean 
discrepancy scores for both groups of normals combined was only 10.56 (-3.44 to 
+7.11) while for the paranoids it was 34.40 (—2.38 to +32.11). Our findings differ 
with Jost’s in that none of our paranoid Ss scored a lower mean discrepancy score 
than the lowest normal, while in his study there were more extreme negative than 
extreme positive scores. This difference may have been due to the fact that Jost’s 
group was comprised of different types of schizophrenics while our group was com- 
prised solely of paranoids. 


DIscussion 


In the light of the very deep feelings of failure usually experienced by paranoid 
schizophrenics, the finding that these patients should see others as being more suc- 
cessful than normals see them, may be interpreted as a kind of prevailing attitude 
that everyone must be more successful and adequate than themselves. The signifi- 
cant correlation for the paranoids between level of aspiration and perceiving others 
as relatively more successful might be interpreted in the same vein. The paranoid 
schizophrenic who aspires unusually highly is doomed to even greater failure by his 
own unrealistic standards. This should increase the likelihood that he would tend to 
see other people as far more successful than himself. This interpretation seems 
plausible but is at variance with Rotter’s® notion that lower aspiration level is more 
often associated with feelings of inferiority. Since Rotter did not use schizophrenic 
Ss in his study it is possible that his finding obtains with various non-schizophrenic 
groups but not for paranoid schizophrenic ones. Finally, confirming Miller’s and 
Jost’s findings may to some extent soften Sutcliffe’s“ very appropriate concern 
over the inconsistency found in this area of research. 


SUMMARY 


Three groups of Ss, 18 college men, 18 college women, and 18 hospitalized male 
paranoid schizophrenics were each divided into high and low aspiration levels on the 
basis of scores on the Cassel Group Level of Aspiration Test. All Ss were then asked 
to rate the degree of success or failure expressed by the people pictured in 14 photo- 
graphs selected for this purpose. The design was replicated using new Ss as well as 
new photographs. Paranoid schizophrenics rated the photos as significantly more 
successful than did normals and also showed greater variability of ratings than the 
normals. In general the higher the level of aspiration of the paranoid group, the 
greater the success they attributed to the people pictured. Miller’s finding that para- 
noid patients do not differ from normals in their mean discrepancy scores was con- 
firmed. Jost’s finding that schizophrenic Ss show greater variability of mean dis- 
crepancy scores was also confirmed, although all of our extreme aspirers aspired high 
while most of Jost’s extreme aspirers aspired low. 
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MULTIDISCIPLINARY VIEWS ON THE PREPARATION OF WRITTEN 
CLINICAL PSYCHOLOGICAL REPORTS: I. SPONTANEOUS 
SUGGESTIONS FOR CONTENT 


NORMAN TALLENT AND WILLIAM J. REISS 


Veterans Administration Center, Kecoughtan, Virginia Veterans Administration Center 
and The George Washington University Kecoughtan, Virginia 


INTRODUCTION 

The clinical psychologist working in the area of psychodiagnostics is perennially 
faced with the task of deciding what data he should seek out from his client and com- 
municate to those who will read his written evaluation. Several authors": ?:*: *) have 
published outlines of the sort of content the psychologist should impart to the reader 
of his reports, and psychologists in a number of settings have constructed guides to 
help the staff psychologist, and particularly the trainee, cope with this problem. In 
the investigation reported here, opinions were solicited from a large number of 
psychologists and consumers of psychological reports on what constitutes appro- 
priate content for written psychological evaluations. In this way it was thought 
that important preliminary information might be gained that would be helpful in 
the writing of more useful psychological reports. 


PROCEDURE 

Source of Data. The information utilized in this study was obtained from a 
sample of psychiatrists, psychiatric social workers and clinical psychologists em- 
ployed by the Veterans Administration. In the context of a personally addressed, 
printed form letter psychiatrists and social workers were asked “‘ . . . what sort of 
specific information do you want to get out of a psycho.ogical report?’”’ Psychologists 
were asked “. . .what sort of specific information do you think a psychiatrist or a 
social worker wants to read?”’ Potential respondents to these questions were selected 
without bias from a current, virtually complete VA roster of the above three profess- 
ions. Altogether 375 subjects, 125 in each category, were solicited. Of these, replies 
were received from 71 psychologists, 46 social workers, and 43 psychiatrists. 

Classification of Responses. The data were jointly classified by the writers into 
main categories and subcategories. As a simple check on the reliability of classifica- 
tion, the following test was performed: The writers first selected from the list of 
responses submitted by the psychologists all items which appeared to relate to in- 
tellectual function. By agreement, these were classified into nine subcategories. 
Then 52 responses appearing to pertain to intelligence which were submitted by the 
psychiatrists, and 24 such responses submitted by the social workers were similarly 
obtained and two weeks later these were independently sorted by the writers into 
the above nine subcategories. The agreement level between investigators was 91%. 


RgEsuLTS AND Discussion 

A diversity of topics, 88 in number, were offered by the respondents as proper 
content for psychological reports (tabled data). Some of the items which all three 
professions suggested with relatively high frequency will be recognized as common 
features of psychological reports. Included are the ‘‘personality picture’’, defenses, 
interpersonal relations, drives and /or dynamics, “ego strength’’, current intellectual 
status, assets, and diagnosis. Examples of content items mentioned, particularly 
by the psychiatrists and psychologists, were intellectual deficit, specific aspects of 
intellectual functioning, description of the psychopathological process, presence of 
organicity, and the ability to utilize treatment. The psychologists and social workers 
favored such topics as conflicts, adjustment, assets, relationship to significant life 
figures, perception of self and others, suicide potential and prognosis. 

On some items there was considerable difference of opinion. The psychiatrists 
and social workers appear to be much more interested in having a statement of IQ 
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TaBLE 1. RESPONDENTS’ SUGGESTIONS FOR APPROPRIATE CONTENT FOR 
PsycHOLoGiIcAL REPORTS 





Psychiatrists Psychologists Social Workers 
Content Categories N = 43 N =71 N = 46 





BEHAVIOR DESCRIPTION 
. Examiner—patient interaction 
A. General 
B. Cooperative—uncooperative 
. Appearance 
% -— vioral observations 
= Saneent (not further specified) 
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C. Speech disturbances 

D. Psychomotor disturbances 
E. Anxiety-tension 

F. Test response behavior 


Impressions (Interpreted Observations) 
. Clinical impression 
. Test taking attitude 
. Interest 
. Hostility 
. Persistence 
. Suspicion 

INTELLECTUAL Factors 
. Assets 

A. General 

> Optimal functionin 

= a intellectual features 

. Liabilities 

A. General 

B. Native low intelligence 

C. Deficit 
. Current intellectual status 
. Specific aspects of intellectual functioning 
. Orderliness of thought processes 
. Relation of intelligence to personality 
. Premorbid intelligence 
. 1.Q., intelligence, intelligence level 

Miscellaneous 
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PERSONALITY Factors 
. Personality picture (not further specified) 
. Psychopathology 
A. Description of pepepetniedeat process 
B. Areas of disturbed functioning 
a Behavioral symptomatology 
D. Etiological information 
E. Thinking characteristics 
F. Reality contacts 
. Underlying psychological processes 
(not further specified) 
. Conflicts 
. Adjustment variables 
A. Adjustment 
B. 
C. 
D. Defenses 
. Drives, dynamics 
‘A. Drives and/or dynamics (not further specified) 
B Dependency-independency 
C. Sexual role—identification 
D. Interest areas 
E. Goals 
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TasLe 1—continued 





Psychiatrists Psychologists Social Workers 
Content Categories N = 43 N = 71 N = 46 





PrrRsoNALITY Factors (continued) 
. Personality assets and liabilities 

A. Assets (not further specified) 

B. Ego strength 

C. Liabilities (not further specified) 
. Social variables 

A. Interpersonal relations 

B. Relationship to significant life fi 
. Perception of self (self-cuncept) and others 
. Miscellaneous emotional behaviors 
. Miscellaneous personality variables 


= — 8 
NOOO wwe 


Psycnop1aGnostic Data 
. Organicity 
A. Presence of organicity 
B. Extent of damage 
C. Areas of damage 
. Presence of overt psychosis 
. Presence of underlying psychosis 
. Differential diagnosis 


Diagnosis 
A. General 
ra Severity 
C. Stage of pathology 


PREDICTIONS 


. Outlook 

A. General 

B. Change (improvement) 

C. Change (retrogression) 

D. Interpersonal relations 
. Treatment prospects 

A. Ability to utilize treatment 

B. Information to assist therapist 
. Acting-out potential 
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D. Sexual deviancy 
E. Violence (assaultive behavior towards self 
and others) 
. Vocational—educational outlook 
. Prognosis 
. Prognosis with therapy 
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RECOMMENDATIONS 
. Recommendations (not further specified) 
. Recommendations re: treatment 
A. General 
B. Psychotherapy 
C. Environmental manipulation 
Recommendations for further study 


MISCELLANEOUS 
. Reason for and/or source of referral 
. Identification of tests 
. Case history data 
. Summary and conclusions 
. Other 





*Percent of N expressing these suggestions. 
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or intelligence level than the psychologists are in reporting it. To an appreciable 
extent the consumers of psychological reports apparently continue to see the psy- 
chologist in his older role of psychometrician or “IQ tester’, a role from which the 
latter appears to be backing off with a vengeance. Quite a few psychologists, how- 
ever, feel it proper to write about intellectual status and specific aspects of intellectual 
functioning. 

Recommendations regarding treatment, the item of content which the psycho- 
logist wishes most to convey, is spontaneously mentioned by a relatively small 
number of psychiatrists (psychologists: 74% vs. psychiatrists: 14%). However, half 
of the social workers in the sample suggested the desirability of receiving psychologi- 
cal recommendations. Thirty per cent of the psychologists indicated they should 
convey information to assist the therapist, but only 7% of the psychiatrists and 4% 
of the social workers spontaneously expressed a need for such information. 

The psychiatrists, much more so than is true of psychologists and social workers, 
expressed interest in learning from psychological reports about the orderliness of 
thought processes and about the presence of organicity. This interest in organicity, 
perhaps, is in keeping with the medical or basically organic background of the 
psychiatrist, but might reflect also a feeling on his part that psychological tests are 
indeed helpful indicators of organicity. 

Relatively little general concern is shown with pathology, which is perhaps a bit 
surprising since all of the respondents were drawn from psychiatric settings. More 
interest is placed on the overall personality picture. Nor is there an emphasis on 
learning of the presence of an underlying psychosis, or, for that matter, of the specific 
mange psychological processes even though psychological tests are reputed to 

go deep”’. 

Certain other contemporary approaches to the recording of the personality 
also received little encouragement from the results of this study. Thus there is 
relatively low representation of items such as discussions of the clinical impression, 
the patient’s affect, optimal intellectual functioning, etiology, the patient’s de- 
pendency-independency status, his goals, and case history data. Is it possible that 
the psychologist, for some reason or other, feels under compulsion to include in his 
report certain matters which no one, including himself, cares much about? 

Beyond the immediate issue of the kind of material which constitutes proper 
content for psychological reports, aspects of the question of role and role relation- 
ships in the mental health professions are touched upon by the data, but certainly 
not answered. Is it the psychologist’s role, for example, primarily to report raw 
scores, such as IQ’s — “‘the facts’? — as some of the respondents put it, or should 
he be more concerned with interpretation and recommendation? 


SUMMARY 


Psychologists, psychiatrists and social workers were asked to supply opinions 
on the kinds of material which constitute proper content for psychological reports in 
a psychiatric setting. The findings give some suggestions concerning what members 
of these professions tend to consider as important topics for clinical evaluation. The 
data may be further interpreted as a challenge to some of the content commonly 
found in psychological reports. 


REFERENCES 


1. Hamwonn, K. R. and ALLEN, J. M. Writing clinical reports. New York: Prentice-Hall, 1953. 

2. Kuoprer, B., Ainsworts, M. O., Kioprer, W. G., and Hour, R. R. Developments in the Ror- 
schach technique. Vol. 1. Technique and Theory. Ch. 17, Principles of report writing. Yonkers-on- 
Hudson, New York: World Book, 1954. 

3. Taytor, J. L. and Te1cuer, A. A clinical approach to reporting psychological test data. J. clin. 
Psychol., 1946, 2, 323-332. 

_ F. C. A new outline for psychological report writing. J. clin. Psychol., 1956, 12, 





THE RELATIONSHIP OF THE H-T-P TO A SOCIOMETRIC 
EVALUATION OF A GROUP OF PRIMARY GRADE SCHOOL CHILDREN 
IN DETERMINING THE DEGREE OF SOCIAL ACCEPTANCE* 


RITA G. ORGEL 
Merrick (N. Y.) Elementary Schools 


PROBLEM 


This study assesses the relationship of the House-Tree-Person drawing tech- 
nique to a sociometric survey of the popularity of children within a primary grade 
class in school. Our hypothesis was that there would be a positive correlation be- 
tween the degree of social acceptance demonstrated by means of a sociogram of the 
children’s peer group relationships and their rated social adjustment as tapped by 
the H-T-P test. 

Tolor and Tolor“) compared children’s popularity from both human figure 
drawings and sociometric choices. Four of five psychologist judges for the human 
figure drawings were able to distinguish successfully between the more popular and 
less popular children as indicated on the sociogram. Their findings demonstrated 
that the more popular subjects possessed more desirable personal traits on the draw- 
ings than did the less popular children ; however, they did find a slight overlapping of 
desirable traits present in all degrees of social acceptance. 


METHOD 


The experimental population consisted of 32 white children, 17 boys and 15 
girls, in Grade 2 in a Public School from middle class socio-economic home environ- 
ments in a suburban Long Island community. The age range was from 7 years to 
7-11, mean 7-4. 

In the initial phase of the experiment a sociometric questionnaire concerning the 
peer group community’s preferences and rejection patterns was administered and 
diagrammed. The students of the group were supplied with paper and pencils and 
told to make three choices among their peers concerning whom they would most like 
to sit with in first, second and third choice order. They were also instructed to write 
the name of the child in the group that the subject did not desire to sit next to, and 
include it at the bottom of the preference list. These data were assembled and ana- 
lyzed in the sociogram. 

The second or projective phase was the administration and checklist scoring 
of the H-T-P. Standard H-T-P Drawing Forms and No. 2 pencils were distributed 
to the group with the request that they draw the best house, tree and person that 
they could. No supplementary instructions were offered. Queries made by the 
group, such as, “What kind of tree?’’, ““Whose house?”’, and questions concerning the 
sex of the person to be drawn were answered by the response, “‘Do as you see fit,” by 
the examiner. 

From Buck’s“? qualitative evaluation procedures, a checklist was devised, for 
the objective judging of each examinee’s popularity potential. The selection of the 
checklist items was based upon the assumption that certain aspects of perspective, 
proportion, and detailing, as well as omissions and distortions in the drawing reflect 
the child’s perception of and interaction with the environment around him. 

These reproductions were then evaluated by an impartial judge, using a 37 
point checklist.' Each item on the checklist was assigned a mathematical rating of 
one credit. The drawings then received a numerical rating equivalent to the total 
number of scorable points for the drawing in question. Each pupil achieved four 


*The author wishes to [ee her gratitude to Miss Selma Landisberg who acted as an impartial 


judge in this study and offered both constructive criticism and encouragement. 
1Copies of the check list are available from the author. 
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scores, one for the house, the second for the tree, the third for the person and the 
fourth for the sum total of all three drawings. These rated scores were regarded as 
measures of social adjustment for each of the children tested. Garrett’s® formula 
for rho was utilized. 

The scores on the drawings were placed in rank order, i.e., the child receiving 
the highest score was placed first on the list achieving the rank order place of one; 
the child with the lowest score was placed last achieving the rank order place of 
—e Whenever a duplicate score was achieved a mid-score was assigned to 

oth. 


RESULTS 


The correlation between social status as revealed by the sociogram with the 
total checklist score derived from the sums of the scores of all three drawings is 
.271, not significant at the .05 level of confidence. The correlation between social 
status as revealed by the sociogram with the degree of adjustment as obtained from 
the checklist score of the ‘““House’”’ drawing is .378, significant at the .05 level of 
confidence with 30 degrees of freedom. The correlation between the charted social 
status on the sociogram and the ‘‘Tree ’ drawing score is —.173, not significant at the 
.05 level. The correlation between sociometric status of the group with the ‘‘Person”’ 
drawing score is .344, significant at the .05 level, with 30 degrees of freedom. 

The question arises as to why two phases of the triad, 7.e., the House and Person 
correlate and not the third, the Tree. Buck believes that the Tree appears to be the 
best subconscious portrait of the self of the three wholes, seeming to arouse the least 
conscious associations and less ego-defensive action on the part of the subject. @ »- ©) 
The sociometric choices were relatively close to the conscious level. The “Person” 
and the ‘‘House’’, as more easily identified with, are also close to the emotional con- 
scious level. The ‘‘Tree’’, however, hypothesized as a more subconscious projected 
self-portrait is less likely to correlate positively with the sociogram which taps feel- 
ings that are closer to the surface. 


SUMMARY AND CONCLUSIONS 


A sociometric questionnaire (used as a measure of personal popularity) and the 
House-Tree-Person Test were administered to 32 white children in a primary grade 
in a public elementary school situated in a suburban Long Island community. The 
H-T-P drawings were evaluated by an impartial judge experienced in the H-T-P 
technique using a checklist of 37 items devised to evaluate variables of social ad- 
justment. 

It was found that the overall H-T-P drawing scores showed a correlation of .271 
with the sociometric evaluation, not significant at the .05 level of confidence. The 
“House” and ‘‘Person’”’ drawings correlate with the sociometric evaluation, and are 
significant at the .05 level of confidence. The ‘““Tree’”’ showed a negative correlation 
(-.173). 
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PROBLEM 


There have been a number of studies of the ‘‘delay function of the ego” in the 
recent literature. Measures of motor and cognitive inhibition have been shown to be 
correlated ®). Both of these measures are significantly correlated with Rorschach 
M, “  ™ and Rorschach M has also been related to the ability to inhibit the ex- 
pression of an affect). Levine and Spivack “ have demonstrated that the ability 
to forego immediate gratification for a more distant goal is related to time sense, and 
time sense has been shown to relate with specific measures of delay functioning “*: '*), 

Rapaport, Gill and Schafer“*), and Fromm, Hartman and Marschak® from 
the psychoanalytic viewpoint have suggested that intelligence test performance may 
be conceptualized in terms of ego functions. Utilizing this approach, Levine, Glass 
and Meltzoff “) have shown in outpatient adults a correlation of a specific error made 
in the Wechsler-Bellevue (reversal of the mirror image N of the Digit Symbol sub- 
test) with Rorschach M and a measure of cognitive inhibition ability. Levine, 
Spivack and Wight? essentially confirmed the above findings in a group of psychia- 
trically hospitalized adults, but the findings were less clear cut in a group of emotion- 
ally disturbed adolescents. 

Only one study has attempted to directly relate a specific measure of delay 
function to IQ. Working with schizophrenics, Singer, Wilensky and McCraven“® 
found no significant rel.tionship between IQ and motor inhibition ability and accur- 
acy of time estimation. However other findings of this study appear somewhat out 
of keeping with other work. They found no correlation between M and IQ, and ina 
subsequent study Levine, Spivack and Wight? did find a significant relationship 
between M and IQ in a population of schizophrenics. They also failed to extract an 
intellectual factor from their matrix of intercorrelations. Three other factor analytic 
rim one dealing with Rorschach and intelligence test data have extracted such a 

actor“: % 18), 

The evidence suggests that measures of inhibition and time sense may bear 
some relationship to general intelligence. The present study tests this hypothesis 
by relating various tasks which have been used to establish the construct validity of 
the delay function of the ego, to a measure of genera! intelligence. 


METHOD 
Subjects. The experimental group consisted of 47 males, ranging in age from 11 to 19, 
with a median age of 15 years, 7 months. The IQs ranged from 83 to 134, with a mean 
of 105. The Ss were all in residence at a treatment center for emotionally disturbed 
children. None of the Ss were or had been psychotic, and none had known brain 
injury. The diagnoses were mainly of personality and neurotic disorders. 


Procedures. The experimental situation was presented to S as an attempt to develop 
some new tests. S was told that none of the results would become a part of his 
permanent record, in order to minimize any resistance to completing the procedures. 


Time conception. This task consisted of 21 questions, 10 of which asked what age S would be 
when a usually anticipated event (e.g. achieve success, retire, own own home, etc.) would occur in 
life. Eleven items cael S what year certain objective events might occur (e.g. trip to the moon, 
cure for cancer, run a 314 minute mile, etc.). The items were written by the present authors, but 
modeled after a similar procedure reported by Wallace“’). It was hypothesized that the greater 
the extension of time into the future, the more the ability to inhibit both motorically and cog- 
nitively, and the higher S’s IQ. 

Time estimation. This task called for S to indicate by saying “Stop” when he thought that a 
given time period had elapsed. Fifteen, 30 and 60 second time intervals were used. It was 
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hypothesized that the longer S waited before saying “Stop’’, the more ability to inhibit both 
motorically and cognitively, and tue higher S’s I 

Cognitive inhibition (CIT). A list of ten easy paired associates was read to S. After the asso- 
ciates were learned to a criterion of one perfect recitation, S was asked to nd upon presenta- 
tion of the stimulus word with any word other than the learned associate. CIT was taken as the 
average time interval between the pore | of the stimulus and the response for the ten pairs. 
Since this time is presumably hook up in part b a the process of finding a new associate, a measure 
of word association time (WAT) was obtained, either following or preceding an interval filled 
with other tasks. WAT was ouelling | from the mean response time to a list of ten other words. 
It was hypothesized that, contr for free association time, the shorter the CIT, the better 
the motor inhibition and the higher s 8 IQ. 

Motor Inhibition (MIT). This task required that S write the phrase “New Jersey Chamber 
of Commerce’’, between two lines, one half inch apart, no letter to be larger than the space be- 
tween the two lines. S was instructed to write as slowly as possible without stopping, and without 
taking the pencil off the paper. These instructions were repeated throughout the task. It was 
aman ew at that the longer the time taken to complete the phrase, the higher the IQ. 

Intelligence. The measure of intelligence utilized was the full scale Wechsler-Bellevue IQ. 
The IQ scores were taken from clinical files. The IQs were generally not more than one year old. 


RESULTS 


Considering that chronological age may be correlated with one or more of the 
tasks, and would consequently have to be controlled, the relationship of age and the 
other measures was examined by means of chi-square tests. Age was found to be 
unrelated to both measures of inhibition, and to 1Q. It was significantly and posi- 
tively related to time estimation (X? = 7.7; p = .01), and there was a tendency to- 
ward significance when related to time conception (X? = 2.58; p = .13). Older Ss 
extended themselves further into the future than did younger Ss. 

The relationship between motor inhibition and IQ was examined by U test. 
The distribution of motor inhibition times was split at the median (five minutes of 
writing) and the above and below median groups compared for 1Q. The mean IQ 
for good inhibitors was 107, and for the poor inhibitors, 99.8. This difference yielded 
a p value of .06. 

The relationship between cognitive inhibition and IQ was tested by chi-square, 
with the effect of free word association time partialled out by the roving median 
technique®?. The chi-square was 12.6, p = .001 for 1 df. The mean IQ of the better 
cognitive inhibitors was 109, and for the poorer inhibitors, 97. 

The roving median chi-square technique was employed in relating both time 
conception and time estimation to IQ, in order to hold age constant. The measure 
of time estimation used was the total of the three time judgments made to the three 
intervals employed. In order to obtain one measure of time conception, each S’s 
responses to the 10 “‘life’’ questions were summed, and the 47 Ss ranked according 
to these sums. Similarly, each Ss responses to the 11 ‘“‘year” items were summed, 
and the 47 Ss ranked according to these sums. The score for each S was then taken 
as the sum of his two ranks. 

There was a significant relationship between time conception and IQ (X? = 
7.86; p = .01), Ss with greater extension of time into the future having higher IQs 
(M = 109) than Ss with less extension into the future (M = 101). The relationship 
between time estimation and IQ tended toward significance (X? = 2.58; p = .13), 
Ss who waited longer before reporting the time intervals were up tending to have 
higher IQs (M = 107) than those who reported more quickly (M = 104). 

Since both measures of inhibition, and the measures of time estimation and time 
conception were taken as different measures of the delay function of the ego, the 
interrelationships between these measures were also tested. All these tests utilized 
the roving median chi-square, with appropriate control for age or free word associa- 
tion time as the variables demanded. To our surprise, time estimation and time con- 
ception measures were found to be unrelated, just as motor inhibition ability was 
found to be unrelated to cognitive inhibition ability. However, motor inhibition 
ability was significantly related to time estimation (X? = 7.70; p = .01), though not 
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to time conception, and cognitive inhibition ability was significantly related to time 
conception (X? = 4.77; p = .03), and not related to time estimation. 


DIscussION 


The major findings support the general hypothesis that measures of delay 
functioning of the ego as defined by the tasks employed bear a relationship to general 
intelligence. These findings supplement prior findings with adults. The fact that not 
all the measures used intercorrelated, despite their relationship to IQ, suggests that 
behavior on these tasks does not reflect a single dimension of ego delay as had been 
assumed. Singer, et al.“* suggest that there is both a motor and cognitive aspect to 
delay. There may also be developmental differences in the rate of growth of different 
techniques of delay. As in prior work with adults“: 7, the next step would appear 
to be the study of specific details of intelligence test behavior which may correlate 
with different measures of delay. The authors are presently pursuing such issues. 


SUMMARY 


The purpose of the present study was to demonstrate a relationship between 
various measures of ego delaying ability and intelligence in adolescents. Measures of 
delay such as motor inhibition, cognitive inhibition, time estimation, and time con- 
ception were all related to Wechsler Bellevue 1Q, but these measures did not all 
correlate among themselves. The results were interpreted as supporting the general 
view that the adequacy of the “delay function of the ego” bears some relationship 
to measures of general intelligence. 
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THE USE OF LEARNING THEORY IN THE PSYCHOLOGICAL 
TREATMENT OF CHILDREN 
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INTRODUCTION 


For some years there has been growing dissatisfaction amongst British clinical 
psychologists with psychoanalytically orientated psychotherapies. On the other 
hand, there has been an increasing desire to derive treatment procedures from the 
body of general psychological theory, in particular learning theory, which will allow 
of more precise formulations of problem and predictions of outcome according to the 
specific theoretical postulate chosen. The application of conditioning and other 
learning techniques in the treatment of both children and adults was, of course, 
described thirty years ago by M. C. Jones“; twenty years ago, by the Mowrers®?; 
and more recently by H. G. Jones“: *), Eysenck®’, Walton“! ™) and Walton and 
Black “*. , The accommodation of theory and practice has been most fully at- 
tempted so far by Mowrer: ”, Dollard and Miller“, and Shoben® !°, 

The following case study demonstrates that some forms of disordered behavior 
in children can be explained and treated according to well-established principles of 
learning,' without any recourse to ‘“‘psychodynamic”’ concepts and without any at- 
tempt to produce “insight” in the patient by means of verbal “interpretations” of 
behavior. 


CasE Stupy 


Several years ago a child of five and a half was admitted to the hospital in con- 
sequence of her refusal to eat and also because she was suffering from what appeared 
at the time to be rheumatic pains. 


Relevant History. The patient was the second of two girls, the only children in the family. 
The older child had _ born during the war; for the first four years of her life the father had been 
with the Army abroad, and she became the ‘‘mother’s baby’. By the time the younger child was 
born the father was back home, and she became his baby. He gave her lavish attention, played 
with her, and almost from the time she was born fed her (she was breast-fed for only a “few 
weeks”). Apart from the manner in which it was carried out the history of feeding was uneventful. 
The child was enuretic at night until admission to hospital. Frorh infancy, when her father was 
nursing her, the patient “used to hold on to and play with his collar. She would never go to bed 
without his collar . . . If she lost hold of the collar in the night she would call out and mother 
would find it for her’. 

Between the ages of 3 and 4 years the child’s appetite became capricious. She insisted on 
having her main meal at 6 o’clock, when the father came in from work. She would sit on his knee 
throughout the meal and he would often feed her. When the father was present the mother was 
rejected; this younger child was the self-avowed rival of both her mother and her sister. 

Father was the kind of person who would do anything to avoid ‘‘disagreeableness”’, such as 
violent scenes of any kind, and could easily be influenced by threat of this. Thus, going up to bed 
at night and settling for sleep were, for a long time before the child came to hospital, occasions for 
crises and emotional tension. Mother would put her to bed, the patient would scream, and then 
father would go up and read to her until she fell asleep. This went so far at times that not one, 
but several, books were read until the parents were exhausted. 

A month before this child’s fifth birthday the father spent seven weeks in hospital with an 
illness which was to prove fatal. The patient could not see him during this time but clung to his 
collar at night. When he returned home—to die—the child wanted to spend all her time in his 
bedroom. About two months after his discharge the father died; and the mother told the patient 
that he had gone to ‘“‘God’s hospital’. The child expected that he would come back. 

In September of that year the mother started to go out to work, and an aunt used to give the 
patient her meals. Quite fantastic antics were resorted to by her adult relatives to induce her to 
eat her mid-day dinner. For example, one of the relatives would put on a dance or some other 
performance while the grandmother tried to feed her with a spoon. Her appetite deteriorated 
gradually. In October she became ill with what was later diagnosed as acute rheumatism. She 
was confined to bed for six weeks; and during this time her appetite deteriorated to the point 


1The phrase learning theory in the context of this paper is used to cover both Mowrer’s two 
factors, solution learning and sign learning. 
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where she was refusing all solid foods and took only milk and fruit drinks. The family doctor him- 
self tried to feed her with a mn but with no success. Once the child commented to her mother, 
“Dr. B says if I don’t eat I’ll never be a big girl. If I grow into a big girl my daddy won’t recog- 
nize me, will he?’’ By mid-December the doctor, getting worried, called in a pediatrician who 
took both mother and child to an hospital. There the nurses tried forced feeding; but this was quite 
unsuccessful and was in any case stopped by the pediatrician. 

Treatment. The most urgent task now was to revive appetite. The case was discussed in the 
Psychiatric Department. The immediate problem was formulated in terms of simple conditioning 
with father as the conditioned stimulus, _— which the conditioned response of eating had come 
to depend, reinforcement being supplied both by satisfaction of ~.4 as well as by anxiety- 
reduction through sitting on the father’s knee and being fed by him. first step, therefore, was 
to provide a substitute for the father and to arrange a series of experiences that might gradually 
approximate those obtaining before the father’s death. In other words, principles were to be used 

t involved both stimulus-substitution and the generalization continuum. The attempt was 
to be made to replace the father as conditioned stimulus, first by the psychologist and, later, by 
members of the child’s own family, such as uncles, the father’s sister, and finally her mother. It 
was decided that the psychologist, a man, should undertake this part of the treatment, and in 
accordance with the theoretical formulation no ‘‘interpretations’”’ were to be given to the child. 

For the first week the psychologist saw the child every day for an hour and arranged a series 
of play sessions and tea-parties in a large play house which was equipped with miniature furniture 
and paraphernalia such as cooking utensils, plates and tea sets. 

Dome the first session the child and psychologist attended to the various needs of dolls and 
only the dolls were fed. The mother was present on this occasion but was excluded from all sub- 
sequent sessions. On the next afternoon, while worrying considerably about “supper on the ward” 
the child nevertheless ate a number of miniature biscuits and dr: dolls’ cups full of milk laced 
with stout, a beverage to which her father had been partial. Play with dolls and tea parties con- 
tinued every day that week with the child still ref = ony bey meals on the ward, except once when 
she accepted a from an uncle who was visiting her in the evening. At the weekend it was 
decided that the child should return home and in future attend as an out-patient. 

The weekend was spent partly at home and partly at the house of a paternal aunt and uncle. 
(This paternal aunt was later to prove an excellent mother-surrogate, in bridging the gap between 
the dead father and the mother.) On Monday mother reported that the patient had eaten a little 
solid food both at her aunt’s and at home; and while the psychologist was having his coffee the 
child asked for a cup, too. During the subsequent play sessions that week larger cups were sub- 
stituted for the dolls’ cups and full sized biscuits introduced. 

In the middle of the second week the mother reported an “‘embarrassing” experience. The 
were at the grandmother’s house the previous day when suddenly a neighbor, an elderly Irish 
woman, came in with the patient and said to the mother, shaking her finger disapprovingly in her 
face, ‘Don’t you tell me this child won’t eat; she has just eaten two plates of spare-ribs and cab- 
bage.’’ That evening the mother suffered a further “embarrassment” when another neighbor, a 
man, called and the child told him, “I don’t eat in this house. I won’t eat for my mummy.” by 
the end of the second week, although the child still refused to eat with her relatives, she had 
eaten fruit and nuts at this neighbor’s house. 

The Christmas holiday and a head cold now intervened, and the psychologist did not see 
either mother or child for six days. However, after a further week the patient was ne agree to sit 
down to meals with her mother and sister but did not regularly eat anything with them. A month 
after referral the mother reported that the child had for the first time inquired what there was for 
lunch and had eaten something of ee Eee that was available. 

During this period the play sessions continued, never less frequently than four times a 


week, with the patient often es the meals she and the aes ate together and de- 
ole 


termining their other activities. The psychologist played the role of submissive father and pamp- 

ered to her whims, until one day, six weeks after beginning treatment, when the child had him 

skipping for his tea while she beat time on a hammer-peg board, he made a note to introduce at an 

——_ moment a little more “reality” and a little less “pleasure’’ into the father-child re- 
tionship. 

The opportunity came in a day or two when the patient refused to stay in the playroom while 
the psychologist a talk with her mother. He therefore terminated the interview and sent her 
home without the cup of coffee that had been promised her by the clinic secretary. Interestingly, 
the mother was very upset at this, fearing that the child would refuse to attend again. She at- 
tended, however, and a period of retraining was started in which a different set of responses began 
to be rewarded, 7.e., responses other than those of ss the relationship with the therapist. 
She began to do little useful jobs about the clinic, such as mending books, as well as playing. The 
child continued to eat better, now taking a regular breakfast, and even to say she was hungry; 
but after three months of treatment she suffered a set-back when the mother had to return to 
work. Her diet was again reduced to fluids; and she insisted on having a collar, this time an uncle’s 
to go to sleep ‘with. The relapse lasted, however, only three or four days. 

For the next six weeks she was unable to attend the hospital because of measles and then 
chicken-pox. Five months after her initial referral to the Psychiatric Department the child was 
showing an interest in food and developing special tastes; and a month later her eating was no 
longer a cause for anxiety to her relatives. By this time the psychologist had managed to transfer 
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a large part of the father’s mantle on to two uncles, one of whom (the husband of the paternal 
aunt already mentioned) the patient described as her “No. 1 uncle’, saying that she loved him 
‘nearly as much as Daddy’’. Nevertheless, it was learned from the aunt that the psychologist 
still possessed almost divine qualities for this child. 

After seven months this phase of the treatment was concluded. For three years now the child 
has remained free of her symptoms, the nocturnal enuresis ae cleared up twelve months after 
the feeding difficulty had been disposed of. She has been healthy and fairly well-adjusted at 
home, at school, and with her play friends, despite the death of a grandmother, of whom she was 
very fond, two years ago, and more recently, of her No. 1 uncle. 


SUMMARY 


In spite of the fact that for more than thirty years psychologists have been 
interested in applying principles of behavior discovered in general psychology, 
especially those of learning, to the understanding of behavior disorders and neuroses 
in children, these principles have been largely neglected in treatment. In order to 
demonstrate that this failure of transfer need not necessarily be perpetuated, the 
history and treatment were described of a five and a half year old girl who for three 
months had resisted all attempts by her relatives, by the family doctor, and by hos- 
pital nurses to get her to take solid foods. 
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EDITORIAL OPINION 





THE STATUS OF ECLECTICISM 


The dictionary defines eclecticism as the practice of selecting what seems best 
from various systems and sources. It is a significant comment on the status of 
theorizing in clinical psychology that the eclectic viewpoint has not received more 
systematic attention in the literature. A survey of contemporary textbooks indicates 
that the eclectic position frequently is ignored, or described disparagingly. Indeed, 
in certain circles, eclecticism has become almost a dirty word as when used to imply 
a state of theoretical bankruptcy. A recent source book on mental health describes 
the eclectic somewhat begrudgingly as a rare bird who may have something but 
certainly is not as fashionable as adherents of special schools. 


Among the common criticisms of eclecticism are the misconceptions that the 
position is (a) an “hodgepodge’”’ of disconnected facts, (b) largely torn from context, 
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te} unrelated to any unifying theoretical structure, (d) lacking global perspectives, 
e) unsupported by valid theoretical models, and (f) barren as to research-stimulating 
hypotheses. A particular difficulty for the young psychologist is that the eclectic 
position seems to offer little to identify with (except perhaps the security of being a 
solid basic scientist), .e., many seem to feel more secure identifying with some cur- 
rently popular school. Eclecticism, because of its very nature, tends to have no 
militant leaders, no well-knit body of followers. 


The validity of the eclectic position may be somewhat difficult to comprehend 
for psychologists with training limited to one school or systematic viewpoint. In- 
deed, no one whose training has not encompassed a majority of pertinent develop- 
ments in the field is likely to comprehend the eclectic position. It is only by direct 
personal experience with a variety of operational approaches to personality study 
that one is able to interrelate and properly weight the contributions of the various 
schools. Thus, eclecticism frequently is the mark of broad training and experience, 
while adherence to a particular school may indicate limited training, research ex- 
perience and interests. 


In reply to claims that eclecticism is an hodgepodge of unrelated facts uninte- 
grated by any systematic theory, the fact is that most eclectics accept and utilize 
the same theoretical unifying principles characterizing the important schools, in- 
cluding such concepts as homeostasis, motivation, unification, self-consistency and 
integration, in the attempt to meaningfully interrelate pertinent facts from whatever 
source. To the degree which the eclectic is able to deal with all the operational ap- 
proaches for studying the whole organisism interacting with its environment, he 
tends to be a genuine field theorist emphasizing all factors organizing personality 
integration. 

If anything, the eclectic approach may provide more fruitful leads for research 
and hypothesis formulating than the approach of any special school whose outlook 
tends to be operationally limited by perceptual biases. The basic method of eclec- 
ticism is empiricism, the constant evaluation of raw data to discover causal relations. 
Instead of starting with theoretical preconceptions and then checking the fit of the 
facts to the model, the eclectic usually proceeds inductively, gathering and analyzing 
the data, and only later attempting to construct explanatory theories. 


After basic science psychology largely has won the struggle conducted over many 
decades to free itself from dialectic philosophical and metaphysical speculation in 
favor of a thoroughgoing objectivism and eclecticism, it is indeed discouraging to 
discover that such obsolete trends flourish unabated in certain areas of clinical 
psychology and psychiatry. Discernible not only in theoretical writings but also in 
other expressive behaviors, we find a continuing contamination of objective research 
data by hypotheses and interpretations reflecting philosophic speculations, per- 
ceptual biases, limited operational approaches and specialized interests. Even scienti- 
fic psychological journals may be rated according to the consistency with which 
editorial policy is solidly experimentally-statistically oriented or permits philosophic- 
political trends to intrude under the guise of “humanism” or democracy. We are in- 
clined to believe that ultimately with the advance of science, the eclectic and the 
theorist will be one and the same person. or ¢ 


F. C. T. 
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